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(1390 REV. 5-93) US DEPT. OF COMMERCE PATENT & TRADEMARK OFFICE 

TRANSMITTAL LETTER TO THE 

UNITED STATES 
DESIGNATED/ELECTED OFFICE 
(DO/EO/US) CONCERNING A FILING 
UNDER 35 U.S.C. 371 


ATTORNEY'S DOCKET NUMBER 
110048 


U.S. APPLICATION NO. 
(if known, sec 37 C.F.R.1 .5) 

09/869927 


INTERNATIONAL APPLICATION NO. 
PCT/FROO/00144 


INTERNATIONAL FILING DATE 
January 21, 2000 


PRIORITY DATE CLAIMED 
January 21, 1999 



TITLE OF INVENTION 

ENDOGENEOUS NUCLEIC ACID FRAGMENT ASSOCIATED WITH AN AUTOIMMUNE DISEASE, LABELING METHOD AND 
REAGENT 



APPLICANT(S) FOR DO/EO/US 

Glaucia PARAHOS-BACCALA, Francois MALLET, Cecile VOISSET 

Applicant herewith submits to the United States Designated/Elected Office (DO/EO/US) the following items and other 
information: 

1 . S3 T his is a FIRST submission of items concerning a filing under 35 U.S.C. 371 . 

2. □ This is a SECOND or SUBSEQUENT submission of items concerning a filing under 35 U.S.C. 371 . 

3. S3 This express request to begin national examination procedures (35 U.S.C. 371(f)) at any time rather than 

delay examination until the expiration of the applicable time limit set in 35 U.S.C. 371(b) and PCT Articles 22 
and 39(1). - - 

4. S3 A proper Demand for International Preliminary Examination was made by the 19th month from the earliest 

claimed priority date. 

5. S3 A copy of the International Application as filed (35 U.S.C. 371 (c)(2)) 

a- D is transmitted herewith (required only if not transmitted by the International Bureau). 

b. S3 has been transmitted by the International Bureau. 

c. □ is not required, as the application was filed in the United States Receiving Office (RO/US) 

6. S3 A translation of the International Application into English (35 U.S.C. 371(c)(2)). 

7. □ Amendments to the claims of the International Application under PCT Article 19 (35 U.S.C. 371(c)(3)) 

a. □ are transmitted herewith (required only if not transmitted by the International Bureau). 

b. □ have been transmitted by the International Bureau. 

c. □ have not been made; however, the time limit for making such amendments has NOT expired. 

d. □ have not been made and will not be made. 

8. □ A translation of the amendments to the claims under PCT Article 19 (35 U.S.C. 371(c)(3)). 

9. □ An oath or declaration of the inventor(s) (35 U.S.C. 371 (c)(4)). 

10. S3 A translation of the annexes to the International Preliminary Examination Report under PCT Article 36 

(35 U.S.C. 371 (c)(5)). 

Items 11. to 16. below concern other document(s) or information included: 

11. □ An Information Disclosure Statement under 37 CFR 1 .97 and 1 .98. 

12. □ An assignment document for recording. A separate cover sheet in compliance with 37 CFR 3.28 and 3.31 is 

included. 

13. S3 A FIRST preliminary amendment 

□ A SECOND or SUBSEQUENT preliminary amendment. 

14. □ A substitute specification. 

15. □ Entitlement to small entity status is hereby asserted. 

16. □ Other items or information: 
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INTERNATIONAL APPLICATION NO. 
'PCT/FROO/00144 



ATTORNEY'S DOCKET NUMBER 
110048 



17. ^ The following fees are submitted: 

Basic National fee (37 CFR 1.492(a)(l)-(5)): 

Search Report has been prepared by the EPO or JPO .... $860.00 

International preliminary examination fee paid to USPTO 

(37 CFR1.482) $690.00 

No international preliminary examination fee paid to USPTO 
(37 CFR 1.482) but international search fee paid to USPTO 
(37 CFR 1.445(a)(2)) $710.00 

Neither international preliminary examination fee (37 CFR 

1.482) nor international search fee (37 CFR 1.445(a)(2)) 

paid to USPTO $1,000.00 

International preliminary examination fee paid to USPTO 
(37 CFR 1 .482) and all claims satisfied provisions of PCT 
Article 33(2)-(4) $ 100.00 

ENTER APPROPRIATE BASIC FEE AMOUNT = 


CALCULATIONS 


PTO USE ONLY 




$860.00 




Surcharge of $130.00 for furnishing the oath or declaration later than 
□ 20 □ 30 months from the earliest claimed priority date (37 CFR 
1.492(e)). 


$ 




Claims 


Number Filed 


Number 
Extra 


Rate 






Total Claims 


18-20 = 


0 


X $ 18.00 


$ 




Independent Claims 


5- 3 = 


2 


X $ 80.00 


$160.00 




Multiple dependent claim(s)(if applicable) 


+ $270.00 


$ 




TOTAL OF ABOVE CALCULATIONS = 


$1020.00 




Reduction by 1/2 for filing by small entity, if applicable. 


$ 




SUBTOTAL = 


$1020.00 




Processing fee of $130.00 for furnishing the English translation later 
than □ 20 □ 30 month from the earliest claimed priority date (37 CFR 
1.492(f)). + 


$ 




TOTAL NATIONAL FEE = 


$1020.00 






Amount to be 
refunded 


$ 


Charged 


$ 



a. [>3 Check No. 120730 in the amount of $ 1020.00 to cover the above fees is enclosed. 

b. □ Please charge my Deposit Account No. in the amount of $ to cover the above fees. A duplicate copy 

of this sheet is enclosed. 

13 The Director is hereby authorized to charge any additional fees which may be required, or credit any overpayment, 
to Deposit Account No. 15-0461 . A duplicate copy of this sheet is enclosed. 



c. 



NOTE: Where an appropriate time limit under 37 CFR 1.494 or 1.495 has not been met, a petition to revive (37 CFR 
1.137(a) or (b)) must be filed and granted to restore the application to pending status. 

SEND ALL CORRESPONDENCE TO: 
Oliff & Berridge, PLC 
P.O. Box 19928 
Alexandria, Virginia 22320 



Date: July 9. 2001 




NAME:/ William P. B§ 
REOfSFRATION NUMBER: 

NAI\|E: Joel S. Armstrong 
REGISTRATION NUMBER: 36,430 



(1390Rev.10-00) 
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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 
In re the Application of 

Glaucia PARANHOS-BACCALA et al. BOX: SEQUENCE 

Application No. : 09/869,927 

Filed: August 17, 2001 Docket No.: 110048 

For: ENDOGENEOUS NUCLEIC ACID FRAGMENT ASSOCIATED WITH AN 
AUTOIMMUNE DISEASE, LABELING METHOD AND REAGENT 

SUPPLEMENTAL PRELIMINARY AMENDMENT 

Director of the U.S. Patent and Trademark Office 
Washington, D.C. 20231 

Sir: 

In response to the Notification of Missing Requirements mailed August 21, 2001, 
please amend the above-identified patent application as follows: 
IN THE SPECIFICATION : 

At the end of the application, please replace the current Sequence Listing with the 
attached paper and computer-readable Sequence Listing. 

Page 4, line 13 - page 5, line 1 8, delete current paragraph and insert therefor: 
Location of the clones on the reconstructed genomic RNA sequence: 
cl.6A2 (1321 bp) 1-1325; 

cl.PH74 (535+2229= 2764 bp) 72-606 and 5353-7582; 
cl.24.4 (491+1457= 1948 bp); 115-606 and 5353-6810; 
cl.44.4 (2372 bp) 115-2496; 

cl.PH7 (369+297= 666 bp) 237-606 and 7017-7313; cl.6Al (2938 bp) 586-3559; 



-N ^ 
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cl.Pi5T (2785+566= 3351 bp) 2747-5557 and 7017-7582; 
cl.7A16 (1422 bp) 2908-4337; 

cl.Pi22 (317+1689= 2006 bp) 3957-4273 and 4476-6168; 
cl.C4C5 (1 1 16 bp) 6467-7582 
5'LTR 1..120 

/note="R of 5'LTR (5 1 end uncertain)" 

121. .575 

/note="U5 of 5'LTR" 
misc. 579..596 

/note="PBS, primer binding site, for tRNA-W" 
misc. 606 

/note="splice junction (splice donor site ATCCAAAGTG-GTGAGTAATA 

(SEQ ID NO: 32) and splice acceptor site CTTTTTTCAG-ATGGGAAACG 

(SEQ ID NO: 33), clone RG083M05, GenBank accession AC000064)" 
misc. 5353 

/note- 'splice acceptor site for ORF1 (env)" 
misc. 5560 

/note- ' splice donor site" 
ORF 5581..7194 

/note="ORFl env 538 AA" 

/product-="envelope" 
misc. 7017 

/note="splice acceptor site for ORF2 and ORF3" 
ORF 7039..7194 

/note="ORF2 52 AA" 
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ORF 7112..7255 

/note= n ORF3 48 AA M 
misc. 7244..72S4 

/note- TPT, polypurine tract" 
3'LTR 72S6..7582 

/note="U3-R of 3* LTR (U3-R junction undetermined) 
misc. 7S63..7569 

polyadenylation signal 
IN THE CLAIMS : 

Please cancel claims 17 and 18 without prejudice to or disclaimer of the subject matter 
contained therein. 

Please replace claim 12 as follows: 

12. (Amended) Method for studying and/or monitoring T-cell proliferation 
in vitro, according to which the T cells from a patient are brought into contact with synthetic 
peptides belonging to SEQ ID NO. 31. 

Please add new claims 19-31 as follows: 

-19. Method according to claim 1 0, characterized in that the biological sample is a 
biological fluid chosen from serum, plasma, synovial fluid and urine.— 

-20. Method for studying and/or monitoring T-cell proliferation in vitro, according 
to which the T cells from a patient are brought into contact with transcription/translation 
products as obtained according to the method of claim 19.-- 

— 21 . Reagent for detecting, in a biological sample, an autoimmune disease or 
monitoring pregnancy, comprising at least one transcription/translation product as obtained 
according to the method of claim 19.- 
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—22. Reagent for detecting, in a biological sample, an autoimmune disease or 
monitoring pregnancy, comprising at least one synthetic peptide belonging to 
SEQ ID NO: 31.- 

—23 . Reagent for detecting, in a biological sample, an autoimmune disease or 
monitoring pregnancy, comprising at least one protein according to claim 14.-- 

—24. A method for detecting susceptibility to an autoimmune disease or monitoring 
pregnancy of a patient, comprising bringing a biological sample of said patient into contact 
with at least one fragment according to claim 1 .— 

—25. The method of claim 24, wherein said autoimmune disease is multiple 
sclerosis.- 

—26. A method for detecting susceptibility to an autoimmune disease or monitoring 
pregnancy of a patient, comprising bringing a biological sample of said patient into contact 
with at least one transcription/translation product as obtained according to the method of 
claim 19.-- 

—27. The method of claim 26, wherein said autoimmune disease is multiple 
sclerosis.- 

—28. A method for detecting susceptibility to an autoimmune disease or monitoring 
pregnancy of a patient, comprising bringing a biological sample of said patient into contact 
with at least one synthetic peptide belonging to SEQ ID NO: 3 1 .— 

-29. The method of claim 28, wherein said autoimmune disease is multiple 
sclerosis.— 

—30. A method for detecting susceptibility to an autoimmune disease or monitoring 
pregnancy of a patient, comprising bringing a biological sample of said patient into contact 
with at least one protein according to claim 14.— 
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-3 1 . The method of claim 30, wherein said autoimmune disease is multiple 
sclerosis.— 

REMARKS 

Claims 1-16 and 19-31 are pending. Claims 17 and 18 are canceled; claim 12 is 
amended; and claims 19-31 are added herein. 

Claims 19-23 are directed to subject matter deleted by the previous removal of 

multiple dependencies from the claims. Claims 24-31 are directed to subject matter 

corresponding to canceled use claims 17 and 18. Claim 12 is amended to remove an 

-tendency . However, the subject matter deleted from the claim has been 

\* \ ^ n of claim 20. 
-v, 

r copy and computer-readable copy of the Sequence Listing are 
vith 37 C.F.R. §§1.821-1.825. The contents of the paper copy and 
of the Sequence Listing are the same. No new matter is added. 

Support for the information provided in the Sequence Listing can be found in the original 

Sequence Listing and in the specification at page 4. 
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Prompt and favorable consideration on the merits is respectfully requested. 




Registration No. 30,024 

Melanie L. Mealy 
Registration No. 40,085 



WPB:MLM/jca 



Attachments: 

Appendix 

Sequence Listing (paper and computer-readable copies) 



Date: October 22, 2001 

Oliff & BERRIDGE, PLC 
P.O. Box 19928 
Alexandria, Virginia 22320 
Telephone: (703) 836-6400 



DEPOSIT ACCOUNT USE 

AUTHORIZATION 
Please grant any extension 

necessary for entry; 
Charge any fee due to our 
Deposit Account No. 15-0461 
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APPENDIX 

Changes to Specification: 

The Sequence Listing is replaced. 

The following is a marked-up version of the amended paragraph: 

Page 4, line 13 - page 5, line 18: 
Location of the clones on the reconstructed genomic RNA sequence: 
cl.6A2(1321 bp) 1-1325; 

cl.PH74 (535+2229= 2764 bp) 72-606 and 5353-7582; 
cl.24.4 (491+1457= 1948 bp); 115-606 and 5353-6810; 
cl.44.4 (2372 bp) 115-2496; 

cl.PH7 (369+297= 666 bp) 237-606 and 7017-7313; cl.6Al (2938 bp) 586-3559; 
cl.Pi5T (2785+566= 3351 bp) 2747-5557 and 7017-7582; 
cl.7A16 (1422 bp) 2908-4337; 

cl.Pi22 (317+1689= 2006 bp) 3957-4273 and 4476-6168; 
cl.C4C5 (1 116 bp) 6467-7582 
5'LTR 1..120 

/note="R of 5'LTR (5' end uncertainHsie}" 

121..575 

/note="U5 of 5'LTR" 
misc. 579..596 

/note="PBS, primer binding site, for tRNA-W" 



A-1 



Docket No. 110048 



Application No. 09/869,927 



misc. 



misc. 



misc. 



ORF 



misc. 



ORF 



ORF 



misc. 



606 

/note="splice junction (splice donor site ATCCAAAGTG-GTGAGTAATA 
fSEO ID NO: 32) and splice acceptor site CTTTTTTCAG-ATGGGAAACG. 
fSEO ID NO: 33V clone RG083M05, GenBank accession AC000064)" 
5353 

/note-'splice acceptor site for ORF1 (env)" 
5560 

/note-' splice donor site" 
5581..7194 

/note="ORFl env 538 AA" 

/product-="envelope" 

7017 

/note=" splice acceptor site for ORF2 and ORF3" 

7039..7194 

/note="ORF2 52 AA" 

7112..7255 

/note="ORF3 48 AA" 

7244..7254 



/note="PPT, polypurine tract" 
3'LTR 7256..7582 

/note="U3-R of 3' LTR (U3-R junction undetermined) 
misc. 7563.-7569 

polyadenylation signal 
Changes to Claims: 

Claims 17 and 18 are canceled. 
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Claims 19-31 are added. 

The following is a marked-up version of the amended claim: 
12. (Amended) Method for studying and/or monitoring T-cell proliferation 
in vitro, according to which the T cells from a patient are brought into contact with e ith e r 
transcription/translation products, as obtain e d according to the m e thod of claim 1 1 , or 
synthetic peptides belonging to SEQ ID NO. 3 1 . 
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SEQUENCE LISTING 



<110> PARANHO S - BACC ALA r Glaucia 
MALLET, Francois 
VOISSET, Cecile 

<120> ENDOGENEOUS NUCLEIC ACID FRAGMENT ASSOCIATED WITH AN AUTOIMMUNE 
DISEASE, LABELING METHOD AND REAGENT 

<130> 110048 

<140> 09/869,927 
<141> 2001-08-17 

<150> PCT/FR00/00144 
<151> 2000-01-21 

<150> FR 99/00888 
<151> 1999-01-21 

<160> 33 

<170> Patentln version 3.1 

<210> 1 

<211> 1511 

<212> DNA 

<213> Homo sapiens 



<400> 



1 



cctagaacgt 


attctggaga 


attgggacca 


atgtgacact 


cagacgctaa 


gaaagaaacg 


60 


atttatattc 


ttctgcagta 


ccgcctggcc 


acaatatcct 


cttcaaggga 


gagaaacctg 


120 


gcttcctgag 


ggaagtataa 


attataacat 


catcttacag 


ctagacctct 


tctgtagaaa 


180 


ggagggcaaa 


tggagtgaag 


tgccatatgt 


gcaaactttc 


ttttcattaa 


gagacaactc 


240 


acaattatgt 


aaaaagtgtg 


gtttatgccc 


tacaggaagc 


cctcagagtc 


cacctcccta 


300 


ccccagcgtc 


ccctccccga 


ctccttcctc 


aactaataag 


gacccccctt 


taacccaaac 


360 


ggtccaaaag 


gagatagaca 


aaggggtaaa 


caatgaacca 


aagagtgcca 


atattccccg 


420 


attatgcccc 


ctccaagcag 


tgagaggagg 


agaattcggc 


ccagccagag 


tgcctgtacc 


480 


tttttctctc 


tcagacttaa 


agcaaattaa 


aatagaccta 


ggtaaattct 


cagataaccc 


540 


tgacggctat 


attgatgttt 


tacaagggtt 


aggacaatcc 


tttgatctga 


catggagaga 


600 


tataatgtta 


ctactaaatc 


agacactaac 


cccaaatgag 


agaagtgccg 


ctgtaactgc 


660 


agcccgagag 


tttggcgatc 


tttggtatct 


cagtcaggcc 


aacaatagga 


tgacaacaga 


720 


ggaaagaaca 


actcccacag 


gccagcaggc 


agttcccagt 


gtagaccctc 


attgggacac 


780 


agaatcagaa 


catggagatt 


ggtgccacaa 


acatttgcta 


acttgcgtgc 


tagaaggact 


840 


gaggaaaact 


aggaagaagc 


ctatgaatta 


ctcaatgatg 


tccactataa 


cacagggaaa 


900 


ggaagaaaat 


cttactgctt 


ttctggacag 


actaagggag 


gcattgagga 


agcatacctc 


960 


cctgtcacct 


gactctattg 


aaggccaact 


aatcttaaag 


gataagttta 


tcactcagtc 


1020 


agctgcagac 


attagaaaaa 


acttcaaaag 


tctgccttag 


gcccggagca 


gaacttagaa 


1080 


accctattta 


acttggcatc 


ctcagttttt 


tataatagag 


atcaggagga 


gcaggcgaaa 


1140 


cgggacaaac 


gggataaaaa 


aaaaaggggg 


ggtccactac 


tttagtcatg 


gccctcaggc 


1200 


aagcagactt 


tggaggctct 


gcaaaaggga 


aaagctgggc 


aaatcaaatg 


cctaataggg 


1260 


ctggcttcca 


gtgcggtcta 


caaggacact 


ttaaaaaaga 


ttatccaagt 


agaaataagc 


1320 


cgcccccttg 


tccatgcccc 


ttacgtcaag 


ggaatcactg 


gaaggcccac 


tgccccaggg 


1380 


gatgaagata 


ctctgagtca 


gaagccatta 


accagatgat 


ccagcagcag 


gactgagggt 


1440 


gcccggggcg 


agcgccagcc 


catgccatca 


ccctcacaga 


gccccgggta 


tgtttgacca 


1500 


ttgagagcca 


a 
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<210> 2 
<211> 2009 



2 



<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 
<222> (1940) . . (1940) 



<223> n = 


a or g or 


c or t/u 










<400> 2 
atacgactac 


tatagggcga 


attgggccct 


ctagatgcat 


gctcgagcgg 


ccgccagtgt 


60 


gatggatatc 


tgcagaattc 


gccctttgtc 


cgctgtgctc 


ctgatccagc 


gaggcgccca 


120 


ttgctgctcc 


caattgggct 


aaaggcttgc 


cattgttccc 


acacggctaa 


gtgcccgggt 


180 


tcatcctaat 


tgagctgaac 


actagtcact 


gggttccatg 


gttctcttcc 


atgacccacg 


240 


gcttctaata 


gagctctaat 


actcaccaca 


tggcccaaga 


ttccattcct 


tggaatccgt 


300 


gaggccaaga 


accccaggtc 


agagaacacg 


aggcttgcca 


ccgtcttgga 


agtggcccgc 


360 


cgccatcttg 


ggagctctgg 


gagcaaggac 


cccccagtaa 


cattttggca 


accacaaagg 


420 


gacctccaaa 


gcgatgggaa 


acattccccc 


caaggcaaaa 


acgcccctaa 


gatgtattct 


480 


ggagaattgg 


gaccaatgtg 


acactcagac 


gctaagaaag 


aaacgattta 


tattcttctg 


540 


cagtaccgcc 


tggccacaat 


atcctcttca 


agggagagaa 


acctggcttc 


ctgagggaag 


600 


tataaattat 


aacatcatct 


tacagctaga 


cctcttctgt 


agaaaggagg 


gcaaatggag 


660 


tgaagtgcca 


tatgtgcaaa 


ctttcttttc 


attaagagac 


aactcacaat 


tatgtaaaaa 


720 


gtgtggttta 


tgccctacag 


gaagccctca 


gagtccacct 


ccctacccca 


gcgtcccccc 


780 


cccgactcct 


tcctcaacta 


ataaggaccc 


ccctttaacc 


caaacggtcc 


aaaaggagat 


840 


agacaaaggg 


gtaaacaatg 


aaccaaagag 


tgccaatatt 


ccccgattat 


gccccctcca 


900 


agcagtgaga 


ggaggagaat 


tcggcccagc 


cagagtgcct 


gtaccttttt 


ctctctcaga 


960 


cttaaagcaa 


attaaaatag 


acctaggtaa 


attctcagat 


aaccctgacg 


gctatattga 


1020 


tgtt ttacaa 


gggttaggac 


aatcctttga 


tctgacatgg 


agagatataa 


tgttactact 


1080 


aaat cagaca 


ctaaccccaa 


atgagagaag 


tgccgctgta 


actgcagccc 


gagagtttgg 


1140 


cgatctttgg 


tatctcagtc 


aggtcaacaa 


taggatgaca 


acagaggaaa 


gaacaactcc 


1200 


cacaggccag 


caggcagttc 


ccagtgtaga 


ccctcattgg 


gacacagaat 


cagaacatgg 


1260 



3 



agattggtgc 


cacaaacatt 


tgctaacttg 


cgtgctagaa 


craactaaacra 


aaactaggaa 


1320 


gaagcctatg 


aattactcaa 


tgatgtccac 


tataacacag 


ggaaaggaag 


aaaatcctac 


1380 


tgcttttctg 


gacagactaa 


gggaggcatt 


gaggaagcat 


acctccctgt 


V—-' L-* VJ CI V 


144 0 


tattgaaggc 


caactaatct 


taaaggataa 


gtttat cact 


cagt cage tg 


cagacat t ag 


1500 


aaaaaacttc 


aaaagtctgc 


cttaggcccg 


gagcagaact 


tagaaaccct 




1560 


gcatcctcag 


ttttttataa 


tagagatcag 


gaggagcagg 


egaaaeggga 


caaaegggat 


1620 


aaaaaaaaaa 


qqqqqqqtcc 


actactttag 


tcatggccct 


caggcaagca 


gact t tggag 


1680 


gctctggaaa 


agggaaaagc 


tgggcaaat c 


aaatgcctaa 


t~ac^c^c^ct~c^^'r , 

^— V-J y <wJ 


"t~1 - rT'^rf1~rrr , n' 

< — v — v — ci u \ • y 


1740 


gtctacaagg 


acactttaaa 


aaagattat c 


caacrt"acraaa 

t-i m ^4 <— 1-4. c_a 


V — <— ± V^- 




_L U \J \J 


gccccttacg 


tcaagggaat 


cactggaagg 


cccactgccc 


caggggatga 


agatactctg 


1860 


agtcagaagc 


cattaaccag 


atgatccagc 


agcaggactg 


agggtgcccg 


gggegagege 


1920 


cagcccatgc 


catcaccctn 


acagagcccc 


gggtatgctt 


gaccattgag 


agecaggagg 


1980 


ttaactgtct 


cctggacact 


ggcgcagcc 








2009 



<210> 3 

<211> 1056 

<212> DNA 

<213> Homo sapiens 



<400> 3 
etagaaegta 


ttctggagaa 


ttgggaccaa 


tgtgacactc 


agaegctaag 


aaagaaacga 


60 


tttatattct 


tetgeagtae 


cgcctggcca 


caatatcctc 


ttcaagggag 


agaaacctgg 


120 


cttcctgagg 


gaagtataaa 


ttataacatc 


atcttacagc 


tagacctctt 


ctgtagaaag 


180 


gagggcaaat 


ggagtgaagt 


gccatatgtg 


caaactttct 


tttcattaag 


agacaactca 


240 


caattatgta 


aaaagtgtgg 


tttatgeect 


acaggaagee 


ctcagagtcc 


acctccctac 


300 


cccagcgtcc 


cctccccgac 


tccttcctca 


actaataagg 


accccccttt 


aacccaaacg 


360 


gtccaaaagg 


agatagacaa 


aggggtaaac 


aatgaaccaa 


agagtgccaa 


tattccccga 


420 


ttatgccccc 


tccaagcagt 


gagaggagga 


gaatteggee 


cagecagagt 


gcctgtacct 


480 


ttttctctct 


cagacttaaa 


gcaaattaaa 


atagacctag 


gtaaattctc 


agataaccct 


540 


gaeggctata 


ttgatgtttt 


acaagggtta 


ggacaatcct 


ttgatctgac 


atggagagat 


600 



4 



ataatgttac 


tactaaatca 


gacactaacc 


ccaaatgaga 


gaagtgccgc 


tgtaactgca 


660 


gcccgagagt 


ttggcgatct 


ttggtatctc 


agtcaggcca 


acaataggat 


gacaacagag 


720 


gaaagaacaa 


ctcccacagg 


ccagcaggca 


gttcccagtg 


tagaccctca 


ttcrcrcrar'^r'^ 


7ft n 


gaatcagaac 


atggagattg 


gtgccacaaa 


catttgctaa 


cttgcgtgct 


a cr a a crcra p 1~ ci 


8 4 0 


aggaaaacta 


y y aciy ciciLj ( 


"J" ^ ~f~~ iT ^ "H ""1 

LdLyddLLdC 


t caatgatgt 


ccactataac 


acagggaaag 


900 


gaagaaaatc 


ttactgcttt 


tctggacaga 


ctaagggagg 


cattgaggaa 


gcatacctcc 


960 


ctgtcacctg 


actctattga 


aggccaacta 


atcttaaagg 


ataagtttat 


cactcagtca 


1020 


gctgcagaca 


ttagaaaaaa 


cttcaaaagt 


ctgcct 






1056 



<210> 4 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 4 

cggacatcca aagtgatggg aaacg 

<210> 5 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 5 

ggacaggaaa gtaagactga gaaggc 

<210> 6 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 6 

cctagaacgt attctggaga attggg 



5 



<210> 7 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 7 

tggctctcaa tggtcaaaca tacccg 

<210> 8 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 8 

atgggaaacg ttccccccga g 

<210> 9 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 9 

ggcctaaggc agacttttga a 

<210> 10 

<211> 409 

<212> PRT 

<213> Homo sapiens 

<400> 10 

Met Gly Ser Ser His His His His His 
1 5 



21 



21 



His Ser Ser Gly Leu Val Pro 
10 15 



6 



Arg Gly Ser His Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg 
20 25 30 



lie Met Gly Asn lie Pro Pro Lys Ala Lys Thr Pro Leu Arg Cys lie 
35 40 45 



Leu Glu Asn Trp Asp Gin Cys Asp Thr Gin Thr Leu Arg Lys Lys Arg 
50 55 60 



Phe lie Phe Phe Cys Ser Thr Ala Trp Pro Gin Tyr Pro Leu Gin Gly 
65 70 75 80 



Arg Glu Thr Trp Leu Pro Glu Gly Ser lie Asn Tyr Asn lie lie Leu 
85 90 95 



Gin Leu Asp Leu Phe Cys Arg Lys Glu Gly Lys Trp Ser Glu Val Pro 
100 105 110 



Tyr Val Gin Thr Phe Phe Ser Leu Arg Asp Asn Ser Gin Leu Cys Lys 
115 120 125 



Lys Cys Gly Leu Cys Pro Thr Gly Ser Pro Gin Ser Pro Pro Pro Tyr 
130 135 140 



Pro Ser Val Pro Pro Pro Thr Pro Ser Ser Thr Asn Lys Asp Pro Pro 
145 150 155 160 



Leu Thr Gin Thr Val Gin Lys Glu He Asp Lys Gly Val Asn Asn Glu 
165 170 175 



Pro Lys Ser Ala Asn He Pro Arg Leu Cys Pro Leu Gin Ala Val Arg 
180 185 190 



Gly Gly Glu Phe Gly Pro Ala Arg Val Pro Val Pro Phe Ser Leu Ser 
195 200 205 



Asp Leu Lys Gin He Lys He Asp Leu Gly Lys Phe Ser Asp Asn Pro 
210 215 220 



Asp Gly Tyr He Asp Val Leu Gin Gly Leu Gly Gin Ser Phe Asp Leu 
225 230 235 240 



7 



Thr Trp Arg Asp He Met Leu Leu Leu Asn Gin Thr Leu Thr Pro Asn 
245 250 255 



Glu Arg Ser Ala Ala Val Thr Ala Ala Arg Glu Phe Gly Asp Leu Trp 
260 265 270 



Tyr Leu Ser Gin Val Asn Asn Arg Met Thr Thr Glu Glu Arg Thr Thr 
275 280 285 



Pro Thr Gly Gin Gin Ala Val Pro Ser Val Asp Pro His Trp Asp Thr 
2^0 295 300 



Glu Ser Glu His Gly Asp Trp Cys His Lys His Leu Leu Thr Cys Val 
305 310 315 320 



Leu Glu Gly Leu Arg Lys Thr Arg Lys Lys Pro Met Asn Tyr Ser Met 
325 330 335 



Met Ser Thr He Thr Gin Gly Lys Glu Glu Asn Pro Thr Ala Phe Leu 
340 345 350 



Asp Arg Leu Arg Glu Ala Leu Arg Lys His Thr Ser Leu Ser Pro Asp 
355 360 365 



Ser He Glu Gly Gin Leu He Leu Lys Asp Lys Phe He Thr Gin Ser 
370 375 380 



Ala Ala Asp He Arg Lys Asn Phe Lys Ser Leu Pro Lys Leu Ala Ala 
385 390 395 400 



Ala Leu Glu His His His His His His 
405 



<210> 


11 


<211> 


393 


<212> 


PRT 


<213> 


Homo 


<400> 


11 



Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg He Met Gly Asn 
15 10 15 



8 



lie Pro Pro Lys Ala Lys Thr Pro Leu Arg Cys lie Leu Glu Arg lie 
20 25 30 



Leu Glu Asn Trp Asp Gin Cys Asp Thr Gin Thr Leu Arg Lys Lys Arg 
35 40 45 



Phe He Phe Phe Cys Ser Thr Ala Trp Pro Gin Tyr Pro Leu Gin Gly 
50 55 60 



Arg Glu Thr Trp Leu Pro Glu Gly Ser He Asn Tyr Asn He He Leu 
65 70 75 80 

Gin Leu Asp Leu Phe Cys Arg Lys Glu Gly Lys Trp Ser Glu Val Pro 
85 90 95 



Tyr Val Gin Thr Phe Phe Ser Leu Arg Asp Asn Ser Gin Leu Cys Lys 
100 105 110 



Lys Cys Gly Leu Cys Pro Thr Gly Ser Pro Gin Ser Pro Pro Pro Tyr 
115 120 125 



Pro Ser Val Pro Ser Pro Thr Pro Ser Ser Thr Asn Lys Asp Pro Pro 
130 135 140 



Leu Thr Gin Thr Val Gin Lys Glu He Asp Lys Gly Val Asn Asn Glu 
145 150 155 160 

Pro Lys Ser Ala Asn He Pro Arg Leu Cys Pro Leu Gin Ala Val Arg 
165 170 175 



Gly Gly Glu Phe Gly Pro Ala Arg Val Pro Val Pro Phe Ser Leu Ser 
180 185 190 



Asp Leu Lys Gin He Lys He Asp Leu Gly Lys Phe Ser Asp Asn Pro 
195 200 205 



Asp Gly Tyr He Asp Val Leu Gin Gly Leu Gly Gin Ser Phe Asp Leu 
210 215 220 

Thr Trp Arg Asp He Met Leu Leu Leu Asn Gin Thr Leu Thr Pro Asn 

225 230 235 240 



9 



Glu Arg Ser Ala Ala Val Thr Ala Ala Arg Glu Phe Gly Asp Leu Trp 
245 250 255 



Tyr Leu Ser Gin Ala Asn Asn Arg Met Thr Thr Glu Glu Arg Thr Thr 
260 265 270 



Pro Thr Gly Gin Gin Ala Val Pro Ser Val Asp Pro His Trp Asp Thr 
275 280 285 



Glu Ser Glu His Gly Asp Trp Cys His Lys His Leu Leu Thr Cys Val 
290 295 300 



Leu Glu Gly Leu Arg Lys Thr Arg Lys Lys Pro Met Asn Tyr Ser Met 
305 310 315 320 



Met Ser Thr He Thr Gin Gly Lys Glu Glu Asn Leu Thr Ala Phe Leu 
325 330 335 



Asp Arg Leu Arg Glu Ala Leu Arg Lys His Thr Ser Leu Ser Pro Asp 
340 345 350 



Ser lie Glu Gly Gin Leu He Leu Lys Asp Lys Phe He Thr Gin Ser 
355 360 365 



Ala Ala Asp He Arg Lys Asn Phe Lys Ser Leu Pro Lys Leu Ala Ale 
370 375 380 



Ala Leu Glu His His His His His His 
385 390 



<210> 12 

<211> 26 

<212> DNA 

<213> Homo sapiens 



<400> 12 

etagaaegta ttctggagaa ttggga 

<210> 13 
<211> 20 



10 



<212> DNA 

<213> Homo sapiens 



<400> 13 

cctaaggcag acttttgaag 20 

<210> 14 

<211> 54 

<212> DNA 

<213> Homo sapiens 

<400> 14 

tttggtaata cgactcacta tagggcagcc accatgggaa acgttccccc cgag 54 

<210> 15 

<211> 44 

<212> DNA 

<213> Homo sapiens 

<400> 15 

tttttttttt tttttttttc aggctgcgcc agtgtccagg agac 44 

<210> 16 

<211> 20 

<212> DNA 

<213> Homo sapiens 



<400> 16 

tgtccgctgt gctcctgatc 20 

<210> 17 
<211> 44 
<212> DNA 



11 



<213> Homo sapiens 



<400> 17 

tttttttttt tttttttttc aggctgcgcc agtgtccagg agac 44 

<210> 18 

<211> 678 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<222> (594) . . (594) 

<223> n = a or g or c or t/u 



<400> 18 
tcagggatag 


cccccatcta 


tttggccagg 


cattagccca 


agacttgagc 


cagttctcat 


60 


acctggatat 


tcttgtcctt 


tggtatgcgg 


atgatttact 


tttagccgcc 


cgttcagaaa 


120 


ccttgtgcca 


tcaagccacc 


caagtgctct 


taaatttcct 


cgccacctgt 


ggctacaagg 


180 


tttccaaacc 


aaaggctcag 


ctctgctcac 


agcagaaggc 


tatttaccct 


aaatacttag 


240 


ggctgaaatt 


atccaaaggc 


accagggccc 


tcagtgagga 


atgtatccag 


cctatactgg 


300 


cttatcctta 


tcccaaaacc 


ctaaaacaac 


taagaaggtt 


ccttggcata 


ataggcataa 


360 


caggcataac 


aggtttctgc 


tgaatatgga 


ttcccaagta 


cggcaaaata 


gccagaccat 


420 


tatatacact 


aattaaggaa 


actcagaaag 


ccaataccca 


tttagtaaga 


tggacacctg 


480 


aagcagaggc 


agctttccag 


gccgtaaaga 


acaccctaac 


ccaagcccca 


gtgttaagct 


540 


tgccagcggg 


gcaagacttt 


tctttctgtg 


tcacagaaaa 


aataggaata 


gctntaggag 


600 


tccttacaca 


ggtccgaggg 


accagcttgc 


aacccatggc 


atacctgagt 


aaggaaattg 


660 


atgtagtggc 


aaagggtt 










678 



<210> 19 
<211> 591 



12 



<212> DNA 

<213> Homo sapiens 



<400> 19 
ccatggccat 


ctacactgaa 


caagatttat 


acaatcatgt 


cgtacctaag 


ccccacaaca 


60 


aaagagtacc 


cattcttcct 


tttgttatca 


gagcaggagt 


gctaggcaga 


ctaggtactg 


120 


gcattggcag 


tatcacaacc 


tctactcagt 


tctactacaa 


actatctcaa 


gaaataaatg 


180 


gtgacatgga 


acaggtcact 


gactccctgg 


tcaccttgca 


agatcaactt 


aactccctag 


240 


cagcagtagt 


ccttcaaaat 


cgaagagctt 


tagacttgct 


aaccgccaaa 


agagggggaa 


300 


cctgtttatt 


tttaggagaa 


gaacgctgtt 


attatgttaa 


tcaatccaga 


attgtcactg 


360 


agaaagttaa 


agaaattcga 


gatcgaatac 


aatgtagagc 


agaggagctt 


caaaacaccg 


420 


aacgctgggg 


cctcctcagc 


caatggatgc 


cctgggttct 


ccccttctta 


ggacctctag 


480 


cagctctaat 


attgttactc 


ctctttggac 


cctgtatctt 


taacctcctt 


gttaagtttg 


540 


tctcttccag 


aattgaagct 


gtaaagctac 


agatggtctt 


acaaatctag 


a 


591 



<210> 20 

<211> 1321 

<212> DNA 

<213> Homo sapiens 



<400> 20 
caacaatcgg 


gatataaacc 


caggcattcg 


agctggcaac 


agcagccccc 


ctttgggtcc 


60 


cttccctttg 


tatgggagct 


gttttcatgc 


tatttcactc 


tattaaatct 


tgcaactgca 


120 


ctcttctggt 


ccatgtttct 


tacggctcga 


gctgagcttt 


tgctcaccgt 


ccaccactgc 


180 


tgtttgccac 


caccgcagac 


ctgccgctga 


ctcccatccc 


tctggatcct 


gcagggtgtc 


240 


cgctgtgctc 


ctgatccagc 


gaagcgccca 


ttgccgctcc 


caattgggct 


aaaggcttgc 


300 


cattgttcct 


gcacggctaa 


gtgcctgggt 


ttgttctaat 


tgagctgaac 


actagtcact 


360 


gggttccatg 


gttctcttct 


gtgacccacg 


gcttctaata 


gaactataac 


acttaccaca 


420 


tggcccaaga 


ttccattcct 


tggaatccgt 


gaggccaaga 


actccaggtc 


agagaatacg 


480 


aagcttgcca 


ccatcttgga 


agcggcctgc 


taccatcttg 


gaagtggttc 


accaccatct 


540 


tgggagctct 


gtgagcaagg 


accccccggt 


aacattttgg 


caaccacgaa 


cggacatcca 


600 
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aagtgatggg 


aaacgttccc 


cgcaagacaa 


aaacgcccct 


aagacgtatt 


ctggaaaatt 


660 


gggaacaatt 


tgaccctcag 


acactaagaa 


agaaacgact 


tatattcttc 


1" crca n't" nrrn 


720 


cctggcactc 


ctgagggaag 


tataaattat 


aacaccatct 


tacagctaga 


cctcttttgt 


780 


agaaaaggca 


aatggagtga 


agtgccataa 


gtacaaactt 


tcttttcatt 


aagagacaac 


840 


tcacaattat 


gtaaaaagtg 


tgatttatgc 


cctacaggaa 


gccttcagag 


t ct acctccc 


900 


tatcccagca 


tccccgactc 


cttccccact 


taataaggac 


cccccttcaa 


p r* c a a a t - a crt" 


960 


ccaaaaggag 


atagacaaaa 


gggtaaacag 


tgaaccaaag 


agtgccaata 


tt rrrraaf f 

O O tLA a. L- 


102 0 


atgacccctc 


caagcagtgg 


gaggaagaga 


attcaaccca 


crr'r'acran'1" nr 


ai~rrtn'r , r*i~t"l~ 


1 ORO 


ttctctccca 


gacttaaagc 


aaataaaaac 


agacttaggt 


aaattctcag 


ataaccctga 


1140 


tggctatatt 


ggtgttttac 


aagggttagg 


acaattcttt 


gatctgacat 


ggagagatat 


1200 


atatgtcact 


gctaaatcag 


acactaaccc 


caaatgagag 


aagtgccacc 


ataactgcag 


1260 


cctgagagtt 


tggcgatctc 


tggtatctca 


gtcaggtcaa 


tgataggatg 


acaacagagg 


1320 


a 
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<210> 21 

<211> 2938 

<212> DNA 

<213> Homo sapiens 



<400> 21 
caacgacgga 


catccaaagt 


gatgggaaac 


gttccccgca 


agacaaaaac 


gcccctaaga 


60 


cgtattctgg 


agaattggga 


ccaatttgac 


cctcagacac 


taagaaagaa 


acgacttata 


120 


ttcttctgca 


gtgccgcctg 


gcactcctga 


gggaagtata 


aattataaca 


ccatcttaca 


180 


gctagacttc 


ttttgtagaa 


aaggcaaatg 


gagtgaagtg 


ccataagtac 


aaactttctt 


240 


ttcattaaga 


gacaactcac 


aattatgtaa 


aaagtgtgat 


ttatgcccta 


caggaagcct 


300 


tcagagtcta 


cctccctatc 


ccagcatccc 


cgactccttc 


cccaactaat 


aaggaccccc 


360 


cttcaaccca 


aatggtccaa 


aaggagatag 


acaaaagggt 


aaacagtgaa 


ccaaagagtg 


420 


ccaatattcc 


ccaattatga 


cccctcccaa 


gcagtgggag 


gaagagattc 


ggcccagcca 


480 


gagtgcatgt 


gctttttctt 


ctcccagact 


taaagcaaat 


aaaaacagac 


ttaggtaaat 


540 


tctcagataa 


tcctgatggc 


tatattgatg 


ttttacaagg 


gttaggacaa 


ttctttgatc 


600 
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tgacatggag agatataatg tcactgctaa atcagacact aaccccaaat gagagaagtg 660 

ccaccataac tgcagcctga gagtttggcg atctctggta tctcagtcag gtcaatgata 720 

ggatgacaac agaggaaaga gatgatcccc acagccagca agcagttccc agtctasacc 780 

ctcattgggg acacagaaat cagtaacatg ggagattggt gctgcagaca tttgctaact 840 

tgtgtgctac aaggactaag gaaaactacg aagaaaatct acgaattact caatgatgtc 900 

caccataaca caggggaagg gaagaaaatc ctactgcctt tctggagaga ctaagggagg 960 

cattgaggaa gcgtgcctct ctgtcacctg actcttctga aggccaacta atcttaaagc 1020 

gtaagtttat cactcagtca gctgcagaca ttagaaaaaa cttcaaaagt ctgccgtagg 1080 

cccggagcaa aacttagaaa ccctattgaa cttggcaacy tcggtttttt ataatagaga 1140 

tcaggaggag caggcggaac aggacaaacg ggattaaaaa aaaggccacc gctttagtca 1200 

tgaccctcag gcaagtggac tttggaggct ctggaaaagg gaaaagctgg gcaaattgaa 1260 

tgcctaatag ggcttgcttc cagtgcggtc tacaaggaca ctttaaaaaa gattgtccaa 1320 

gtagaagtaa gccgcccctt cgtccatgcc ccttatttca agggaatcac tggaaggccc 1380 

actgccccag gggacaaagg tcttttgagt cagaagccac taaccagatg atccagcagc 1440 

aggactgagg gtgcctgggg caagcgccat cccatgccat caccctcaca gagccctggg 1500 

tatgcttgac cattgagggc caggaaggtt gtctcctgga cactggtgcg gtcttcttag 1560 

tcttactctt ctgtcccgga caactgtcct ccagatctgt cactatctga gggggtccta 1620 

agacgggcag tcactagata cttctcccag ccactaagtt atgactgggg agctttattc 1680 

ttttcacatg cttttctaat tatgcttgaa agccccacta ccttgttagg gagagacatt 1740 

ctagcaaaag caggggccat tatacacctg aacataggag aaggaacacc cgtttgttgt 1800 

cccctgcttg aggaaggaat taatcctgaa gtctgggcaa cagaaggaca atatggacga 1860 

gcaaagaatg cccgtcctgt tcaagttaaa ctaaaggatt ccacttcctt tccctaccaa 1920 

aggcagtacc ccctcagacc caaggcccaa caaggattcc aaaagattgt taaggactta 1980 

aaagcccaag gcttagtaaa accatgcata actccctgca gtaattccgt agtggattga 2040 

ggaggcacag aaacccagtg gacagtggag ggttagtgca agatctcagg attatcaatg 2100 

gaggccgttg tccttttata cccagctgta cctagccctt atactgtgct ttcccaaata 2160 

ccagaggaag cagagtggtt tacactcctg gaccttaagg atgccttctt ctgcatccct 2220 

gtacatcctg actctcaatt cttgtttgcc tttgaagata cttcaaaccc aacatctcaa 2280 

ctcacctgga ctgttttacc ccaagggttc agggatagcc cccatctatt tggccaggca 2340 



15 



ttagcccaag acttgagcca atcctcatac ctggacactt gtccttcggt aggtggatga 2400 

tttacttttg gccgcccatt cagaaacctt gtgccatcaa gccacccaag cgctcttcaa 2460 

tttcctcgct acctgtggct acatggtttc caaaccaaag gctcaactct gctcacagca 2520 

ggttacttag ggctaaaatt atccaaaggc accagggccc tcagtgagga acacatccag 2580 

cctatactgg cttatcctca tcccaaaacc ctaaagcaac taaggggatt ccttggcgta 2 640 

ataggtttct gccgaaaatg gattcccagg tttggcgaaa tagccaggtc attaaataca 2700 

ctaattaagg aaactcagaa agccaatacc catttagtaa gatggacaac tgaagtagaa 2760 

gtggctttcc aggccctaac ccaagcccca gtgttaagtt tgccaacagg gcaagacttt 2820 

tcttcatatg tcacagaaaa aacaggaata gctctaggag tccttacaca gatccgaggg 2880 

atgagcttgc aacctgtggc gtacctgact aaggaaattg atgtagtggc aaagggtt 2938 

<210> 22 

<211> 1422 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<222> (879) . . (879) 

<223> n = a or g or c or t/u 



<220> 

<221> misc_f eature 

<222> (1200) . . (1200) 

<223> n = a or g or c or t/u 



<400> 22 

tcagggatag cccccatcta tttggccagg 
acctggacac tcttgtcctt cagtatgtgg 
ccttgtgcca tcaagccacc caagcactct 



cattagccca agacttgagt cagttatcat 60 
atgatttact tttagctgcc tgttcagaaa 120 
taaatttcct cgccacctgt ggctacaagg 180 
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tttccaaaga 


gaagctcagc 


tctgctcaca 


gcaggttaaa 


tacttaggac 


taagattatc 


240 


caaaggcacc 


aaggccctca 


gtgaggaatg 


tatccagcct 


atactggctt 


atcctcatct 


300 


caaaacccta 


aagcaactaa 


gagagttcct 


tggcataaca 


ggcttctgcc 


gaatatggat 


360 


tccccaggta 


tggcaaaata 


gccaggccat 


tatatacagt 


aattaaggaa 


actcagaaag 


420 


ccaataccca 


tttaataaga 


tggatacctg 


aagccaaagt 


ggctttccag 


gcccct aaag 


480 


aaggccttaa 


acccaagtcc 


cagtgttaag 


cttgccaacg 


gggcaagact 


tttctttata 


540 


catcacagaa 


aaaaacagaa 


acagctctgg 


gagtccttac 


acaggtccaa 


gggacgagct 


600 


tgcaacccat 


ggcatacctg 


agtaaggaaa 


ctgatgtagt 


ggcaaagggt 


tggcttcatt 


660 


gtttatgggt 


agtggtggca 


gtagcaqttq 


tagtatctga 


agcagttaaa 


ataatapaerrr 

± L>p Oi. (_■! L_* C*t \*J \Jk 


720 


ggagagatct 


tactgtgtgg 


acatctcatg 


aggtgaacag 


catactcact 


gctaaaggag 


780 


acttgtggct 


gtcagacaac 


cgtttactta 


aatatcaggc 


tctattactt 


gaaaggccag 


840 


tgctgcaact 


gtgcacttgt 


gcaactctta 


acccagtcnc 


attt cttcca 


gacaatgaag 


900 


atagaatata 


actgtcaaca 


aataatttct 


caaacctatg 


ccactcgagg 


ggaccttcta 


960 


gaagttccct 


tgactgatcc 


tgaccttcaa 


cttgtatact 


gatggaagtt 


cctttcrtacr^ 


1020 


aaaaggactt 


caaaagcggg 


gtatgcagtg 


gtcagtgata 


atggaatatt 


tgaaagtatc 


1080 


ccctcactcc 


aggaactagt 


gcttagctgg 


cagaactaat 


agcctt catt 


csnncicpiC'tpin 
:d y — — ' y 


1 1 40 

-L -L *3 \J 


aattaggaga 


aggaaaaagg 


gtaaatatat 


atacagactc 


tgagtatgct 


c a cc tag ten 


1200 


tccatgccca 


tgaggcaata 


tgcagagaaa 


gggaattcct 


aacttccgag 


ggaacaccta 


1260 


tcacacatca 


ggaagccatt 


aggagattat 


tactggcagt 


acagaaacct 


aaagaggtgg 


1320 


aagtcttaca 


ctgctggggt 


catcagaaag 


gaaagaaaag 


ggaaatagaa 


gggaattgee 


1380 


aagcagatat 


tgaagcaaaa 


agagctgcaa 


ggcaggaccc 


tc 




1422 



<210> 23 

<211> 2006 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc feature 



17 



<222> (305) . . (305) 

<223> n = a or g or c or t/u 



<400> 23 
atgcagtggt 


cagtgataat 


ggaatacttg 


aaagtaatcc 


cctcactcca 


ggaactagtg 


60 


ctcagctagc 


agaactaata 


gccctcactt 


gggcactaga 


attaggagaa 


gaaaaaaggg 


120 


caaatatata 


tacagactct 


aaatatgctt 


acctagtcct 


ccatgcccat 


gcagcaatat 


180 


ggaaagaaag 


ggaattccta 


acttctgaga 


gaacacctat 


caaacatcag 


gaagccatta 


240 


ggaaattatt 


attggctgta 


cagaaaccta 


aagaggtggc 


agtcttacac 


tgccggggtc 


300 


atcanaaagg 


aaaggaaagg 


gaaaatactt 


ttgcctgcaa 


ctatccaatg 


gaaattactt 


360 


aaaacccttc 


atcaaacctt 


tcacttaggc 


atcgatagca 


cccatcaaat 


ggccaaatca 


420 


ttatttactg 


gaccaggcct 


tttcaaaact 


atcaagcaaa 


tattcagggc 


ctgtgaattg 


480 


tgccaaaaaa 


ataatcccct 


gcctcatcgc 


caagctcctt 


caggaaaaca 


aaaaacaggc 


540 


cattaccctg 


aaaaaaactg 


gcaactgatt 


ttacccacaa 


gcccaaacct 


cagggatttc 


600 


agtatctact 


agtctgggta 


aatactttca 


cgggttgggc 


aaaggccttc 


ccctgtagga 


660 


cagaaaaggc 


ccaagaggta 


ataaaggcac 


tagttcatga 


aataattccc 


agattcggac 


720 


ttccccgagg 


cttacagagt 


gacaatagcc 


ctgctttcca 


ggccacagta 


acccagggag 


780 


tatcccaggc 


gttaggtata 


cgatatcact 


tacactgcgc 


ctgaaggcca 


cagtcctcag 


840 


ggaaggtcga 


gaaaatgaat 


gaaatactca 


aaggacatct 


aaaaaagcaa 


acccaggaaa 


900 


cccacctcac 


atggcctgct 


ctgttgccta 


tagccttaaa 


aagaatctgc 


aactttcccc 


960 


aaaaagcagg 


acttagccca 


tacgaaatgc 


tgtatggaag 


gcccttcata 


accaatgacc 


1020 


ttgtgcttga 


cccaagacag 


ccaacttagt 


tgcagacatc 


acctccttag 


ccaaatatca 


1080 


acaagttctt 


aaaacattac 


aaggaaccta 


tccctgagaa 


gagggaaaag 


aactattcca 


1140 


cccttgtgac 


atggtattag 


tcaagtccct 


tctctctaat 


tccccatccc 


tagatacatc 


1200 


ctgggaagga 


ccctacccag 


tcattttatt 


taccccaact 


gcggttaaag 


tggctggagt 


1260 


ggtcttggat 


acatcacact 


tgagtcaaat 


cctggatact 


gccaaaggaa 


cctgaaaatc 


1320 


caggagacaa 


cgctagctat 


tcctgtgaac 


ctctagagga 


tttgcgcctg 


ctcttcaaac 


1380 


aacaaccagg 


aggaaagtaa 


ctaaaatcat 


aaatccccca 


tggccctccc 


ttatcatatt 


1440 


tttctcttta 


ctgttctttt 


accctctttc 


actctcactg 


caccccctcc 


atgccgctgt 


1500 


atgaccagta 


gctcccctta 


ccaagagttt 


ctatggagaa 


tgcagcgtcc 


cggaaatatt 


1560 
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gatgccccat cgtataggag tctttctaag ggaaccccca ccttcactgc ccacacccat 1620 

atgccccgca actgctatca ctctgccact ctttgcatgc atgcaaatac tcattattgg 1680 

acaggaaaaa tgattaatcc tagttgtcct ggaggacttg gagtcactgt ctgttggact 1740 

tacttcaccc aaactggtat gtctgatggg ggtggagttc aagatcaggc aagagaaaaa 1800 

catgtaaaag aagtaatctc ccaactcacc cgggtacatg gcacctctag ccctacaaag 1860 

gactagatct ctcaaaacta catgaaaccc tccgtaccca tactcgcctg gtaagcctat 1920 

ttaataccac cctcactggg ctccatgagg tctcggccca aaaccctact aactgttgga 1980 

tatgcctccc cctgaacttc aagcca 2006 

<210> 24 

<211> 1948 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<222> (84) . . (84) 

<223> n = a or g or c or t/u 

<220> 

<221> misc_feature 

<222> (193) . . (193) 

<223> n = a or g or c or t/u 

<220> 

<221> mi sc_f eature 

<222> (241) . . (241) 

<223> n = a or g or c or t/u 
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<400> 24 





actgcactct 


tctggtccat 


gtttcttacg 


gctcgagctg 


agcttttgct 


caccgtccac 


60 




cactgctgtt 


tgccaccacc 


gcanacctgc 


cgctgactcc 


catccctctg 


gatcctgcag 


120 




ggtgtccgct 


gtgctcctga 


tccagcgagg 


cgcccattgc 


cgctcccaat 


tgggctaaag 


180 




gcttgccatt 


gtncctgcac 


ggctaagtgc 


ctgggtttgt 


tctaattgag 


ctgaacacta 


240 




ntcactgggt 


tccatggttc 


tcttctgtga 


cccacggctt 


ctaatagaac 


tataacactt 


300 




accacatggc 


ccaagattcc 


attccttgga 


atccgtgagg 


gcaagaactc 


caggtcagag 


360 




aatacgaggc 


ttgccaccat 


cttggaagcg 


gcctgctacc 


atcttggaag 


tggtt caeca 


420 




ccatcttggg 


agctctgtga 


gcaaggaccc 


cccggtaaca 


ttttggcaac 


cacgaaegga 


480 




catccaaagt 


gatacatcct 


gggaaggacc 


ctacccagtc 


attttatcta 


ccccaactgc 


540 




ggttaaagtg 


gctggagtgg 


agtcttggat 


acatcacact 


tgagtcaaat 


cctggatact 


600 




gccaaaggaa 


cctgaaaatc 


caggagacaa 


cgctagctat 


tcctgtgaac 


ctctagagga 


660 


m 


tttgcgcctg 


ctcttcaaac 


aacaaccagg 


aggaaagtaa 


ctaaaatcat 


aaat ccccat 


720 




ggccctccct 


tatcatattt 


ttctctttac 


tgttgtttca 


ccctctttca 


ctct cactgc 


780 




accccctcca 


tgccgctgta 


tgaccagtag 


ctccccttac 


caagagtttc 


tatggagaat 


840 




gcagcgtccc 


ggaaatattg 


atgccccatc 


gtataggagt 


ctttgtaagg 


gaacccccac 


900 




cttcactgcc 


cacacccata 


tgccccgcaa 


ctgctatcac 


tctgccactc 


tttgeatgea 


960 


LiL- 


tgcaaatact 


cattattgga 


caggaaaaat 


gattaatcct 


agttgtcctg 


gaggactt gg 


102 0 




agtcactgtc 


tgttggactt 


acttcaccca 


aactggtatg 


tctqatcrqaa 


gtggagttca 


1080 




agatcaggca 


agagaaaaac 


atgtaaaaga 


agtaatctcc 


caactcaccc 


acrcrtacatcrrT 


1140 




cacctctagc 


ccctacaaag 


gactagatct 


ctcaaaacta 


catgaaaccc 


tccotarrra 

\M \^ C4. V^r C+. 


1200 




tactcgcctg 


gtaagcctat 


ttaataccac 


cctcactggg 


ctccatgagg 


t ct cggccca 


12 60 




aaaccctact 


aactgttgga 


tatgcctccc 


cctgaacttc 


aggccatatg 


t tt caat ccc 


1320 




tgtacctgaa 


caatggaaca 


acttcagcac 


agaaataaac 


accacttccg 


ttttacrt^cTfT 

■ — - * — ■ — — - c* y m 






acctcttgtt 


tccaatctgg 


aaataaccca 


tacctcaaac 


ct cacct crt" a 


l— Cl Cl Cl d L- L- Ldy 






caatactaca 


tacacaacca 


actcccaatg 


catcaggtgg 


gtaactcctc 


v — v — • Cl v ci ■ Cl Cl CL L» 


i son 

1JUU 




3 r~T ~\~ r~* ~\~ e"r /~* ~\~ ^ 
clLj LL Ly LLLd 


ccc t caggaa 


4- — t -i— -I— J I 1— 4-— ,4— 

rat: Tit tttgt 


ctgtggtacc 


tcagcctatc 


gttgtttgaa 


1560 




tggctcttca 


gaatctatgt 


gcttcctctc 


attcttagtg 


ccccctatgg 


ccatctacac 


1620 




tgaacaagat 


ttatacagtt 


atgtcatatc 


taagccccgc 


aacaaaagag 


tacccattct 


1680 




tccttttgtt 


ataggagcag 


gagtgctagg 


tgcactaggt 


actggcattg 


geggtatcac 


1740 
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aacctctact cagttctact acaaactatc tcaagaacta aatggggaca tggaacgggt 1800 

cgccgactcc ctggtcacct tgcaagatca acttaactcc ctagcagcag tagtccttca 18 60 

aaatcgaaga gctttagact tgctaaccgc tgaaagaggg ggaacctgtt tatttttagg 1920 

ggaagaatgc tgttattatg ttaatcaa 1948 

<210> 25 

<211> 1136 

<212> DNA 

<213> Homo sapiens 



<400> 25 
ccatggccat 


ctacactgaa 


caagatttat 


acagttatgt 


catatctaag 


ccccgcaaca 


60 


aaagagtacc 


cattcttcct 


tttgttatag 


gagcaggagt 


gctaggtgca 


ctaggtactg 


120 


gcattggcgg 


tatcacaacc 


tctactcagt 


tctactacaa 


actatctcaa 


gaactaaatg 


180 


gggacatgga 


acgggtcgcc 


gactccctgg 


tcaccttgca 


agatcaactt 


aactccctag 


240 


cagcagtagt 


ccttcaaaat 


cgaagagctt 


tagactcgct 


aaccgctgaa 


agagggggaa 


300 


cctgtttatt 


tttaggggaa 


gaatgctgtt 


attatgttaa 


tcaatccgga 


atcgtcactg 


360 


agaaagttaa 


agaaattcga 


gatcgaatac 


aacgtagagc 


agaagagctt 


cgaaacactg 


420 


gaccctgggg 


cctcctcagc 


caatggatgc 


cctggattct 


ccccttctta 


ggacctctag 


480 


cagctataat 


attgctactc 


ctctttggac 


cctgtatctt 


taacctcctt 


gttaactttg 


540 


tctcttccag 


aatcgaagct 


gtaaaactac 


aaatggagcc 


caagatgcag 


tccaagacta 


600 


agatctaccg 


cagacccctg 


gaccggcctg 


ctagcccacg 


atctgatgtt 


aatgacatca 


660 


aaggcacccc 


tcctgaggaa 


atctcagctg 


cacaacctct 


actacgcccc 


aattcagcag 


720 


gaagcagtta 


gagcggtcgt 


cggccaacct 


ccccaacagc 


acttaggttt 


tcctgttgag 


780 


atgggggact 


gagagacagg 


actagctgga 


tttcctaggc 


tgactaagaa 


tccctaagcc 


840 


tagctgggaa 


ggtgaccaca 


tccaccttta 


aacacggggc 


ttgcaactta 


gttcacacct 


900 


gaccaatcag 


agagctcact 


aaaatgctaa 


ttaggcaaag 


acaggaggta 


aagaaatagc 


960 


caatcatcta 


ttgcatgaga 


gcacagcagg 


agggacaatg 


atcgggatat 


aaacccaagt 


1020 


cttcgagccg 


gcaacggcaa 


ccccctttgg 


gtcccctccc 


tttgtatggg 


agctctgttt 


1080 


tcatgctatt 


tcactctatt 


aaatcttgca 


gctgcgaaaa 


aaaaaaaaaa 


aaaaaa 


1136 
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<210> 26 

<211> 2782 

<212> DNA 

<213> Homo sapiens 



<400> 26 
atgggagctg 


ttttcatgct 


atttcactct 


attaaatctt 


gcaactgcac 


tcttctggtc 


60 


catgtttctt 


acggctcgag 


ctgagctttt 


gctcaccgtc 


caccactgct 


gtttgccacc 


120 


accgcagacc 


tgccgctgac 


tcccatccct 


ctggatcctg 


cagggtgtcc 


gctgtgctcc 


180 


tgatccagcg 


aagcgcccat 


tgccgctccc 


aattgggcta 


aaggcttgcc 


attgttcctg 


240 


cacggctaag 


tgcctgggtt 


tgttctaatt 


gagctgaaca 


ctagtcactg 


ggttccatgg 


300 


ttctcttctg 


tgacccacgg 


cttctaatag 


aactataaca 


cttaccacat 


ggcccaagat 


360 


tccattcctt 


ggaatccgtg 


aggccaacga 


actccaggtc 


agagaatacg 


aagcttgcca 


420 


ccatcttgga 


agcggcctgc 


taccatcttg 


gaagtggttc 


accaccatct 


tgggagctct 


480 


gtgagcaagg 


accccccggt 


gacattttgg 


cgaccaccaa 


cggacatccc 


aagtgataca 


540 


tcctgggaag 


gaccctaccc 


agtcatttta 


tctaccccaa 


ctgcggttaa 


agtggctgga 


600 


gtggagtctt 


ggatacatca 


cacttgagtc 


aaatcctgga 


tactgccaaa 


ggaacctgaa 


660 


aatccaggag 


acaacgctag 


ctattcctgt 


gaacctctag 


aggatttgcg 


cctgctcttc 


720 


aaacaacaac 


caggaggaaa 


gtaactaaaa 


tcataaatcc 


ccatgggcct 


cccttatcat 


780 


atttttctct 


gtagtgttct 


ttcaccctgt 


ttcactctca 


ctgcaccccc 


tccatgccgc 


840 


tgtatgacca 


gtagctcccc 


tcacccagag 


tttctatgga 


gaatgcagcg 


tcccggaaat 


900 


attgatgccc 


catcgtatag 


gagtctttct 


aagggaaccc 


ccaccttcac 


tgcccacacc 


960 


catatgcccc 


gcaactgcta 


tcactctgcc 


actctttgca 


tgcatgcaaa 


tactcattat 


1020 


tggacaggaa 


aaatgattaa 


tcctagttgt 


cctggaggac 


ttggagtcac 


tgtctgttgg 


1080 


acttacttca 


cccaaactgg 


tatgtctgat 


gggggtggag 


ttcaagatca 


ggcaagagaa 


1140 


aaacatgtaa 


aagaagtaat 


ctcccaactc 


accggggtac 


atggcacctc 


tagcccctac 


1200 


aaaggactag 


atctctcaaa 


actacatgaa 


accctccgta 


cccatactcg 


cctggtaagc 


1260 


ctatttaata 


ccaccctcac 


tgggctccat 


gaggtctcgg 


cccaaaaccc 


tactaactgt 


1320 


tggatatgcc 


tccccctgaa 


cttcaggcca 


tatgtttcaa 


tccctgtacc 


tgaacaatgg 


1380 
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aacaacttca 


gcacagaaat 


aaacaccact 


tccgttttag 


taggacctct 


tgtttccaat 


1440 


gtggaaataa 


cccatacctc 


aaacctcacc 


tgtgtaaaat 


ttagcaatac 


t aca tacaca 


1500 


accaactccc 


aatgcatcag 


gtgggtaact 


cctcccacac 


aaatagtctg 


cct accct ca 


1560 


ggaatatttt 


ttgtctgtgg 


tacctcagcc 


tatcgttgtt 


tcraatcrcr^"^~r• 


1~'l _ (~ , arr3p'hr , 'l _ 

c i — v ci y ci a L- l. 




atgtgcttcc 


tctcattctt 


agtgccccct 


atgaccatct 


acactgaaca 


W ^-4 W L-« Ci CI ^ 


J_ \J O \J 


agttatgtca 


tatctaagcc 


ccgcaacaaa 


agagtaccca 


ttcttccttt 


tcrttatanrra 


1740 


gcaggagtgc 


taggtgcact 


aggtactggc 


attggcggta 


tcacaacctc 


tact cagttc 


1800 


tactacaaac 


tatctcaaga 


actaaatggg 


gacatggaac 


gggt cgccga 


ctr , r , r , 1 _ n , rrtr' 

^-^ ^— V-j 


18 60 


accttgcaag 


atcaacttaa 


ctccctagca 


gcagtagtcc 


ttcgaaat eg 


aacracfr , tt~i~a 

LJ. O ^ O CI 


192 0 


gacttgctaa 


ccgctgagag 


agggggaacc 


tgtttatttt 


t a qct cr cr a a cr a 


atcrclrrtt^t" 


X _? o u 


tatgttaatc 


aatccggaat 


cgtcactgag 


aaagttgaag 


aaat t ccaga 


tccraatacaa 

v — - CA CA d. n CI CA 


2 04 0 


cgtatagcag 


aggagcttcg 


aaacactgga 


ccctggggcc 


tcctcagccg 


atggatgccc 


2100 


tggattctcc 


ccttcttagg 


acctctagca 


gctataatat 


tgctactcct 


ctttggaccc 


2160 


tgtatctttg 


acctccttgt 


taactttgtc 


tcttccagaa 


tcgaagctgt 


gaaact acaa 


2220 


atggagccca 


agatgcagtc 


caagactaag 


atctaccgca 


gacccctgga 


occincct'cict' 

' — * > — y >. — \ — i_- y * — • L~ 




agcccacgat 


ctgatgttaa 


tgacatcaaa 


ggcacccctc 


c t era cr era a a t 




? ^ 4 n 


caacctctac 


tacgccccaa 


ttcagcagga 


agcagttaga 


qcqcr tcrat ca 


g cca a cct cc 


2400 


ccaacagcac 


ttaggttttc 


ctgttgagat 


gqqqqactqa 


gagacaggac 


t acrct" a era 1 t~ 


24 60 


tcctaggctg 


actaagaatc 


cttaagccta 


qqtqqqaaqq 


tgaccacatc 


carrttf a^fl 

*■ — ca * — . \ — • c- c o ci ci a 




cacggggctt 


gcaacttagc 


tcacacctga 


ccaatcagag 


agct cact aa 


ca d L, y I l_ CI CL L, l_ 


9 S R 0 
o o u 


aggcaaagac 


aggaggtaaa 


gaaatagcca 


atcatttatt 


gectgagage 


acagcaggag 


2640 


ggacaatgat 


cgggatataa 


acccaagttt 


tcgagccggc 


aacggcaacc 


ccctttgggt 


2700 


cccctccctt 


tgtatgggag 


ctctgttttc 


atgctatttc 


actctattaa 


atettgeaac 


2760 


tgcaaaaaaa 


aaaaaaaaaa 


aa 
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<210> 27 

<211> 666 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_feature 

<222> (119) . . (119) 

<223> n = a or g or c or t/u 



<400> 27 

tgtccgctgt gctcctgatc cagcgaggcg cccattgccg ctcccaattg ggctaaaggc 60 

ttgccattgt tcctgcacgg ctaagtgcct gggtttgttc taattgagct gaacactant 120 

cactgggttc catggttctc ttctgtgacc cacggcttct aatataacta taacacttac 180 

cacatggccc aagattccat tccttggaat ccgtgaggcc aagaactcca ggtcagagaa 240 

tacgaggctt gccaccatct tggaagcggc ctgctaccat cttggaagtg gttcaccacc 300 

atcttgggag ctctgtgagc aaggaccccc cggtaacatt ttggcaacca cgaacggaca 360 

tccaaagtga atcgaagctg taaaactaca aatggagccc aagatgcagt ccaagactaa 420 

gatctaccgc agacccctgg accggcctgc tagcccacga tctgatgtta atgacatcaa 480 

aggcacccct cctgaggaaa tctcagctgc acaacctcta ctacgcccca attcagcagg 540 

aagcagttag agcggtcgtc ggccaacctc cccaacagca cttaggtttt cctgttgaga 600 

tgggggactg agagacagga ctagctggat ttcctaggct gactaagaat ccctaagcct 660 

agctgg 666 

<210> 28 

<211> 3372 

<212> DNA 

<213> Homo sapiens 



<400> 28 

gacttcccaa ataccagagg aagcagagtg gtttacagtc ctggaccttc aggatgcctt 60 

cttctgcatc cctgtacatc ctgactctca attcttgttt gcctttgaag atacttcaaa 120 

cccagcatct caactcacct ggactatttt accccaaggg ttcagggata gtccccatct 180 

atttggccag gcattagccc aagacttgag ccaatcctca tacctggaca cttgtccttc 240 

ggtaggtgga tgatttactt ttggccgccc attcagaaac cttgtgccat caagccaccc 300 

24 



aagcgctctt caatttcctc gctacctgtg gctacatggt ttccaaacca aaggctcaac 360 

tctgctcaca gcaggttact tagggctaaa attatccaaa ggcaccaggg ccctcagtga 420 

ggaacacatc cagcctatac tggcttatcc tcatcccaaa accctaaagc aactaagggg 480 

attccttggc gtaataggtt tctgccgaaa atggattccc aggtatggcg aaatagccag 540 

gtcattaaat acactaatta aggaaactca gaaagccaat acccatttag taagatggac 600 

aactgaagta gaagtggctt tccaggccct aacccaagcc ccagtgttaa gtttgccaac 660 

agggcaagac ttttgttcat atgtcacaga aaaaacagga atagctctag gagtccttac 720 

acagatccga gggatgagct tgcaacctgt ggcacacctg actaaggaaa ttgatgtagt 780 

ggcaaagggt tgacctcatt gtttacgggt agtggtggca gtagcagtct tagtatctga 840 

agcagttaaa ataatacagg gaagagatct tactgtgtgg acatctcatg atgtgaatgg 900 

catactcact gctaaaggag acttgtggct gtcagacaac tgtttactta aatgtcaggc 960 

tctattactt gaagggccag tgctgcgact gtgcacttgt gcaactctta acccagccac 1020 

atttcttcca gacaatgaag aaaagataaa acataactgt caacaagtaa tttctcaaac 1080 

ctatgccact cgaggggacc ttttagaggt tcctttgact gatcccgacc tcaacttgta 1140 

tactgatgga agttcctttg tagaaaaagg acttcgaaaa gtggggtatg cagtggtcag 1200 

tgataatgga atacttgaaa gtaatcccct cactccagga actagtgctc agctagcaga 1260 

actaatagcc ctcacttggg cactagaatt aggagaagaa aaaagggcaa atataataca 1320 

gactctaaat atgcttacct agtcctccat gcccatgcag caatatggaa agaaagggaa 1380 

ttcctaactt ctgagagaac acctatcaaa catcaggaag ccattaggaa attattattg 1440 

gctgtacaga aacctagaga ggtggcagtc ttacactgcc ggggtcatca caaaggaaag 1500 

gaaagggaaa tacaagagaa ctgccaagca tatattgaag ccaaaagagc tgcaaggcag 1560 

gaccctccat tagaaatgct tattaaactt cccttagtat agggtaatcc cttccgggaa 1620 

accaagcccc agtactcagc aggagaaaca gaatggggaa cctcacgagg cagttttctc 1680 

ccctcgggac ggttagccac tgaagaaggg aaaatacttt tgcctgcaac tatccaatgg 1740 

aaattactta aaacccttca tcaaaccttt cacttaggca tcgatagcac ccatcagatg 1800 

gccaaatcat tatttactgg accaggcctt ttcaaaacta tcaagcagat agtcagggcc 1860 

tgtgaagtgt gccagagaaa taatcccctg ccttatcgcc aagctccttc aggagaacaa 1920 

agaacaggcc attaccctgg agaagactgg caactgattt tacccacaag cccaaacctc 1980 

agggatttca gtatctacta gtctgggtag atactttcac gggttgggca gaggccttcc 2040 
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cctgtaggac agaaaaggcc caagaggtaa taaaggcact agttcatgaa ataattccca 2100 

gattcggact tccccgaggc ttacagagtg acaatagccc tgctttccag gccacagtaa 2160 

cccagggagt atcccaggcg ttaggtatac gatatcactt acactgcgcc tgaaggccac 2220 

agtcctcagg gaaggtcgag aaaatgaatg aaacactcaa aggacatcta aaaaagcaaa 2280 

cccaggaaac ccacctcaca tggcctgttc tgttgcctat agccttaaaa agaatctgca 2340 

actttcccca aaaagcagga cttagcccat acgaaatgct gtatggaagg cccttcataa 2400 

ccaatgacct tgtgcttgac ccaagacagc caacttagtt gcagacatca cctccttagc 2460 

caaatatcaa caagttctta aaacattaca aggaacctat ccctgagaag aggaaaagaa 2520 

tattccaccc aagtgacatg gtattagtca agtcccttcc ctctaattcc ccatccctag 2580 

atacatcctg ggaaggaccc tacccagtca ttttatctac cccaactgcg gttaaagtgg 2640 

ctggagtgga gtcttggata catcacactt gagtcaaatc ctggatactg ccaaaggaac 2700 

ctgaaaatcc aggagacaac gctagctatt cctgtgaacc tctagaggat ttgcgcctgc 2760 

tcttcaaaca acaaccagga ggaaaaatcg aagctgtaaa actacaaatg gagcccaaga 2820 

tgcagtccaa gactaagatc taccgcagac ccctggaccg gcctgttagc ccacgatctg 2880 

atgttaatga catcaaaggc acccctcctg aggaaatctc agctgcacaa cctctactac 2940 

gccccaattc agcaggaagc agttagagcg gtcgtcggcc aacctcccca acagcactta 3000 

ggttttcctg ttgagatggg ggactgagag acaggactag ctggatttcc taggctgatt 3060 

aagaatccct aagcctagct gggaaggtga ccacatccac ctttaaacac ggggcttgca 3120 

acttagctca cacctgacca atcagagagc tcactaaaat gctaattagg caaagacagg 3180 

aggtaaagaa atagccaatc atttattgcc tgagagcaca gcaggaggga caatgatcgg 3240 

gatataaacc caagttttcg agccggcaac ggcaaccccc tttgggtccc ctccctttgt 3300 

atgggagctc tgttttcatg ctatttcact ctattaaatc ttgcaactgc aaaaaaaaaa 3360 

aaaaaaaaaa aa 3372 

<210> 29 

<211> 2372 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_feature 

<222> (1191) . . (1191) 

<223> n = a or g or c or t/u 

<220> 

<221> misc_feature 

<222> (1213) . . (1213) 

<223> n = a or g or c or t/u 

<220> 

<221> misc_f eature 

<222> (2089) . . (2089) 

<223> n = a or g or c or t/u 

<220> 

<221> misc_f eature 

<222> (2274) . . (2274) 



<223> n = 


a or g or < 


c or t/u 










<400> 29 
actgcactct 


tctggtccat 


gtttcttacg 


gctcgagctg 


agcttttgct 


caccgtccac 


60 


cactgctgtt 


tgccaccacc 


gcagacctgc 


cgctgactcc 


catccctctg 


gatcctgcag 


120 


ggtgtccgct 


gtgctcctga 


tccagcgagg 


cgcccattgc 


cgctcccaat 


tgggctaaag 


180 


gcttgccatt 


gttcctgcac 


ggctaagtgc 


ctgggtttgt 


tctaattgag 


ctgaacacta 


240 


atcactgggt 


tccatggttc 


tcttctgtga 


cccacggctt 


ctaatagaac 


tataacactt 


300 


accacatggc 


ccaagattcc 


attccttgga 


atccgtgagg 


ccaagaactc 


caggtcagag 


360 


aatacgaggc 


ttgccaccat 


cttggaagcg 


gcctgctacc 


gtcttggaag 


tggttcacca 


420 


ccatcttggg 


agctctgtga 


gcaaggaccc 


cccggtaaca 


ttttggcaac 


caacgacgga 


480 


catccaaagt 


gatgggaaac 


gttccccgca 


agacaaaaac 


gcccctaaga 


cgtattctgg 


540 
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agaattggga ccaatttgac cctcagacac taagaaagaa acgacttata ttcttctgca 600 

gtgccgcctg gcactcctga gggaagtata aattataaca ccatcttaca gctagacctc 660 

ttttgtagaa aaggcaaatg gagtgaagtg ccataagtac aaactttctt ttcattaaga 720 

gacaactcac aattatgtaa aaagtgtgat ttatgcccta caggaagcct tcagagtcta 780 

cctccctatc ccagcatccc cgactccttc cccaactaat aaggaccccc cttcaaccca 840 

aatggtccaa aaggagatag acaaaagggt aaacagtgaa ccaaagagtg ccaatattcc 900 

ccaattatga cccctccaag cagtgggagg aagagaattc ggcccagcca gagtgcatgt 960 

gcctttttct ctcccagact taaagcaaat aaaaacagac ttaggtaaat tctcagataa 1020 

ccctgatggc tatattgatg ttttacaagg gttaggacaa ttctttgatc tgacatggag 1080 

agatataatg tcactgctaa atcagacact aaccccaaat gagagaagtg ccaccataac 1140 

tgcagcctga gggtttggcg tctctggtat ctcagtcagg tcaatggata nggatgacaa 1200 

cagaaggaaa ganaatgatt ccccacaggc cagcaggcag ttcccagtct agaccctcat 1260 

tgggacacag aatcagaaca tggagattgg tgctgcagac atttgctaac ttgtgtgcta 1320 

gaaggactaa ggaaaactag gaagaagtct atgaattact caatgatgtc caccataaca 1380 

cagggaaggg aagaaaatcc tactgccttt ctggagagac taagggaggc attgaggaag 1440 

cgtgcctctc tgtcacctga ctcttctgaa ggccaactaa tcttaaagcg taagtttatc 1500 

actcagtcag ctgcagacat tagaaaaaac ttcaaaagtc tgccgtaggc ccggagcaaa 1560 

acttagaaac cctattgaac ttggcaacct cggtttttta taatagagat caggaggagc 1620 

aggcggaaca ggacaaacgg gattaaaaaa aaggccaccg ctttagtcat gaccctcagg 1680 

caagtggact ttggaggctc tggaaaaggg aaaagctggg caaattgaat gcctaatagg 1740 

gcttgcttcc agtgcggtct acaaggacac tttaaaaaag attgtccaag tagaagtaag 1800 

ccgccccttc gtccatgccc cttatttcaa gggaatcact ggaaggccca ctgccccagg 1860 

ggacaaaggt cttttgagtc agaagccact aaccagatga tccagcagca ggactgaggg 1920 

tgcctggggc aagcgccatc ccatgccatc accctcacag agccctgggt atgcttgacc 1980 

attgagggcc aggaaggttg tctcctggac actggtgcgg tcttcttagt cttactcttc 2040 

tgtcccggac aactgtcctc cagatctgtc actattctga gggggtccnt aagacgggca 2100 

gtcactagat actttttccc agccactaag ttatgaactg gggagcttta ttcttttcac 2160 

atgcttttct aattatgctt gaaagcccca ctaccttgtt agggagagac attctagcaa 2220 

aagcaggggc cattatacac ctgaacatag gagaaggaac acccgtttgt tgtncccctg 2280 
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cttgaggaag gaattaatcc tgaagtctgg gcaacagaag gacaatatgg acgagccaaa 2340 
gaatgcccgt cctgttcaag ttaaactaaa gg 2372 

<210> 30 

<211> 7582 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<222> (198) . . (198) 

<223> n = a or g or c or t/u 

<220> 

<221> misc_feature 

<222> (307) . . (307) 

<223> n = a or g or c or t/u 

<220> 

<221> misc_f eature 

<222> (355) . . (355) 

<223> n = a or g or c or t/u 

<220> 

<221> mi sc_f eature 

<222> (1309) . . (1309) 

<223> n = a or g or c or t/u 

<220> 
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<221> 
<222> 
<223> 



misc_f eature 
(1331) . . (1331) 
n = a or g or c or t/u 



<220> 

<221> misc_f eature 

<222> (2213) . . (2213) 

<223> n = a or g or c or t/u 

<220> 

<221> misc_feature 

<222> (2398) . . (2398) 

<223> n = a or g or c or t/u 

<220> 

<221> misc_feature 

<222> (3787) . . (3787) 

<223> n = a or g or c or t/u 

<220> 

<221> misc_feature 

<222> (4115) . . (4115) 

<223> n = a or g or c or t/u 

<220> 

<221> misc_feature 

<222> (4261) . . (4261) 

<223> n = a or g or c or t/u 



<400> 30 

caacaatcgg gatataaacc caggcattcg agctggcaac agcagccccc ctttgggtcc 60 

cttccctttg tatgggagct gttttcatgc tatttcactc tattaaatct tgcaactgca 120 

ctcttctggt ccatgtttct tacggctcga gctgagcttt tgctcaccgt ccaccactgc 180 

tgtttgccac caccgcanac ctgccgctga ctcccatccc tctggatcct gcagggtgtc 240 

cgctgtgctc ctgatccagc gargcgccca ttgccgctcc caattgggct aaaggcttgc 300 

cattgtncct gcacggctaa gtgcctgggt ttgttctaat tgagctgaac actantcact 360 

gggttccatg gttctcttct gtgacccacg gcttctaata kaactataac acttaccaca 420 

tggcccaaga ttccattcct tggaatccgt gaggscaacg aactccaggt cagagaatac 480 

gargcttgcc accatcttgg aagcggcctg ctaccrtctt ggaagtggtt caccaccatc 540 

ttgggagctc tgtgagcaag gaccccccgg tracattttg gcraccamsr acggacatcc 600 

maagtgatgg gaaacgttcc ccgcaagaca aaaacgcccc taagacgtat tctggaraat 660 

tgggamcaat ttgaccctca gacactaaga aagaaacgac ttatattctt ctgcagtgcc 720 

gcctggcact cctgagggaa gtataaatta taacaccatc ttacagctag acytcttttg 780 

tagaaaaggc aaatggagtg aagtgccata agtacaaact ttcttttcat taagagacaa 840 

ctcacaatta tgtaaaaagt gtgatttatg ccctacagga agccttcaga gtctacctcc 900 

ctatcccagc atccccgact ccttccccam ytaataagga ccccccttca acccaaatgg 960 

tccaaaagga gatagacaaa agggtaaaca gtgaaccaaa gagtgccaat attccccaat 1020 

tatgacccct cccaagcagt gggaggaaga gaattcggcc cagccagagt gcatgtgcyt 1080 

tttyytctcc cagacttaaa gcaaataaaa acagacttag gtaaattctc agataaycct 1140 

gatggctata ttgrtgtttt acaagggtta ggacaattct ttgatctgac atggagagat 1200 

atatatgtca ctgctaaatc agacactaac cccaaatgag agaagtgcca ccataactgc 1260 

agcctgagrg tttggcgatc tctggtatct cagtcaggtc aatggatang gatgacaaca 1320 

gaaggaaaga naatgattcc ccacaggcca gcargcagtt cccagtctas accctcattg 1380 

gggacacaga aatcagtaac atgggagatt ggtgctgcag acatttgcta acttgtgtgc 1440 

tasaaggact aaggaaaact asgaagaaar tctaygaatt actcaatgat gtccaccata 1500 

acacagggga agggaagaaa atcctactgc ctttctggag agactaaggg aggcattgag 1560 

gaagcgtgcc tctctgtcac ctgactcttc tgaaggccaa ctaatcttaa agcgtaagtt 1620 

tatcactcag tcagctgcag acattagaaa aaacttcaaa agtctgccgt aggcccggag 1680 

caaaacttag aaaccctatt gaacttggca acytcggttt tttataatag agatcaggag 1740 
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gagcaggcgg aacaggacaa acgggattaa aaaaaaggcc accgctttag tcatgaccct 1800 

caggcaagtg gactttggag gctctggaaa agggaaaagc tgggcaaatt gaatgcctaa 1860 

tagggcttgc ttccagtgcg gtctacaagg acactttaaa aaagattgtc caagtagaag 1920 

taagccgccc cttcgtccat gccccttatt tcaagggaat cactggaagg cccactgccc 1980 

caggggacaa aggtcttttg agtcagaagc cactaaccag atgatccagc agcaggactg 2040 

agggtgcctg gggcaagcgc catcccatgc catcaccctc acagagccct gggtatgctt 2100 

gaccattgag ggccaggaag gttgtctcct ggacactggt gcggtcttct tagtcttact 2160 

cttctgtccc ggacaactgt cctccagatc tgtcactatt ctgagggggt ccntaagacg 2220 

ggcagtcact agatacttty tcccagccac taagttatga actggggagc tttattcttt 2280 

tcacatgctt ttctaattat gcttgaaagc cccactacct tgttagggag agacattcta 2340 

gcaaaagcag gggccattat acacctgaac ataggagaag gaacacccgt ttgttgtncc 2400 

cctgcttgag gaaggaatta atcctgaagt ctgggcaaca gaaggacaat atggacgagc 2460 

caaagaatgc ccgtcctgtt caagttaaac taaaggattc cacttccttt ccctaccaaa 2520 

ggcagtaccc cctcagaccc aaggcccaac aaggattcca aaagattgtt aaggacttaa 2580 

aagcccaagg cttagtaaaa ccatgcataa ctccctgcag taattccgta gtggattgag 2640 

gaggcacaga aacccagtgg acagtggagg gttagtgcaa gatctcagga ttatcaatgg 2700 

aggccgttgt ccttttatac ccagctgtac ctagccctta tactgtgmyt tcccaaatac 2760 

cagaggaagc agagtggttt acastcctgg accttmagga tgccttcttc tgcatccctg 2820 

tacatcctga ctctcaattc ttgtttgcct ttgaagatac ttcaaaccca rcatctcaac 2880 

tcacctggac trttttaccc caagggttca gggatagycc ccatctattt ggccaggcat 2940 

tagcccaaga cttgagycar tymtcatacc tggacactct tgtccttcrg takgtggatg 3000 

atttactttt rgcygccyrt tcagaaacct tgtgccatca agccacccaa gcrctcttma 3060 

atttcctcgc yacctgtggc tacawggttt ccaaacsara rgctcarctc tgctcacagc 3120 

aggttaaata cttaggrcta arattatcca aaggcaccar ggccctcagt gaggaayrya 3180 

tccagcctat actggcttat cctcatcyca aaaccctaaa gcaactaagr grrttccttg 3240 

gcrtaayagg yttctgccga awatggattc cccaggtwtg gcraaatagc caggycatta 3300 

watacastaa ttaaggaaac tcagaaagcc aatacccatt tartaagatg gayamctgaa 3360 

gymraagtgg ctttccaggc ccctaaagaa ggccttaaac ccaagyccca gtgttaagyt 3420 

tgccaacrgg gcaagacttt tsttyatayr tcacagaaaa aaacagraay agctctrgga 3480 
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gtccttacac agrtccragg gaygagcttg caaccyrtgg cryacctgas taaggaaayt 3540 

gatgtagtgg caaagggttg rcytcattgt ttaygggtag tggtggcagt agcagtykta 3600 

gtatctgaag cagttaaaat aatacagggr agagatctta ctgtgtggac atctcatgak 3660 

gtgaayrgca tactcactgc taaaggagac ttgtggctgt cagacaacyg tttacttaaa 3720 

trtcaggctc tattacttga arggccagtg ctgcractgt gcacttgtgc aactcttaac 3780 

ccagycncat ttcttccaga caatgaagaa aagataraay ataactgtca acaartaatt 3840 

tctcaaacct atgccactcg aggggacctt ytagargttc cyttgactga tccygacctt 3900 

caacttgtat actgatggaa gttcctttgt agaaaaagga cttcgaaaag yggggtatgc 3960 

agtggtcagt gataatggaa tayttgaaag taatcccctc actccaggaa ctagtgctya 4020 

gctrgcagaa ctaatagccy tcaytkgggc actagaatta ggagaagraa aaagggyaaa 4080 

tatatataca gactctrart atgctyacct agtcntccat gcccatgmrg caatatgsar 4140 

agaaagggaa ttcctaactt cygagrgaac acctatcama catcaggaag ccattaggar 4200 

attattaytg gcwgtacaga aacctaraga ggtggmagtc ttacactgcy ggggtcatca 4260 

naaaggaaag raaagggaaa tasaagrgaa ytgccaagca katattgaag cmaaaagagc 4320 

tgcaaggcag gaccctccat tagaaatgct tattaaactt cccttagtat agggtaatcc 4380 

cttccgggaa accaagcccc agtactcagc aggagaaaca gaatggggaa cctcacgagg 4440 

cagttttctc ccctcgggac ggttagccac tgaagaaggg aaaatacttt tgcctgcaac 4500 

tatccaatgg aaattactta aaacccttca tcaaaccttt cacttaggca tcgatagcac 4560 

ccatcaratg gccaaatcat tatttactgg accaggcctt ttcaaaacta tcaagcarat 4620 

aktcagggcc tgtgaaktgt gccararaaa taatcccctg cctyatcgcc aagctccttc 4680 

aggaraacaa araacaggcc attaccctgr araaractgg caactgattt tacccacaag 4740 

cccaaacctc agggatttca gtatctacta gtctgggtar atactttcac gggttgggca 4800 

raggccttcc cctgtaggac agaaaaggcc caagaggtaa taaaggcact agttcatgaa 48 60 

ataattccca gattcggact tccccgaggc ttacagagtg acaatagccc tgctttccag 4920 

gccacagtaa cccagggagt atcccaggcg ttaggtatac gatatcactt acactgcgcc 4980 

tgaaggccac agtcctcagg gaaggtcgag aaaatgaatg aaayactcaa aggacatcta 5040 

aaaaagcaaa cccaggaaac ccacctcaca tggcctgytc tgttgcctat agccttaaaa 5100 

agaatctgca actttcccca aaaagcagga cttagcccat acgaaatgct gtatggaagg 5160 

cccttcataa ccaatgacct tgtgcttgac ccaagacagc caacttagtt gcagacatca 5220 
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cctccttagc 


caaatatcaa 


caagttctta 


aaacattaca 


aggaacctat 


ccctgagaag 


5280 


agggaaaaga 


actattccac 


ccwwgtgaca 


tggt attagt 


caacrtccctt 


f Vrt Ct fi ri 1" t" 

> ' V ^ — ^ O v ■ C» Cl Cl C> 


5340 


ccccat ccct 


agatacatcc 


t gggaaggac 


cctacccagt 


cattttatvt 


arrrraartff 

ci v—- > — ■ v — \ — ci ci u l, y 


5400 


cggttaaagt 


gactacraata 


gagtcttgga 


tacatcacac 


ttgagtcaaa 


tcctcrcratsr' 


5460 


tgccaaagga 


acctgaaaat 


ccaggagaca 


acgctagcta 


tt cctgtgaa 


cct ctagagg 


5520 


atttgcgcct 


gctcttcaaa 


caacaaccag 


gaggaaagta 


actaaaat ca 


taaatccccc 


5580 


atggscctcc 


cttatcatat 


ttttctctkt 


astgttsttt 


varrrt 1 1~ t 

y c-a t_» o ^ 


rartrfrarl" 
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5640 


gcaccccctc 


cat gccgctg 


tatgaccagt 


agct ccccty 


accmagagt t 


tctatcrcracra 


5700 


atgcagcgtc 


ccggaaatat 


tgatgcccca 


tcgtatagga 


gtcttt staa 


gggaaccccc 


5760 


acctt cactg 


cccacaccca 


tatgccccgc 


aactgctatc 


actctgccac 


tptttrrp^tn 
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5820 


catgcaaata 


ctcattattg 


gacaggaaaa 


atgatt aatc 


ctagttgt cc 


tcraacrcract t 
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5880 


ggagtcactg 


tctgttggac 


ttacttcacc 


caaact ggta 


tcrtctcr^tcfcr 
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caagatcagg 


caagagaaaa 


acatgtaaaa 


craacrtaatct 
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uuuu 


ggcacctcta 


gcccctacaa 


aggactagat 


ctct caaaac 
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60 60 


catactcgcc 


tggtaagcct 


atttaatacc 


accct cactg 
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6120 


caaaacccta 


ctaactgttg 


gatatgcctc 


cccctgaact 


l— > — Cl J- y >— - *w Cl C CI 
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61 RO 


cctgtacctg 


aacaatggaa 


caacttcagc 


acagaaataa 
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6940 


ggacct cttg 


tttccaatst 


ggaaataacc 
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agcaatacta 


catacacaac 


caactcccaa 
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63 60 


atagt ctgcc 


taccctcagg 


aatatttttt 


gt ctgtggta 


cct cagccta 


tcattattta 


6420 


aatggctctt 


cagaatctat 


gtgcttcctc 


tcattcttag 
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64ft 0 


actga acaag 


atttatacag 


ttatgtcata 


t ctaagcccc 
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6540 


cttccttttg 


t tataggagc 


^ggagtgct a 
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acaacctcta 


ctcagttcta 
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gtcgccgact 


ccctggtcac 


cttgcaagat 


raarttasrf 
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cty u ct y lll l l 


67 9 0 


craaatcgaa 


gagctttaga 


ctygctaacc 


gctgaragag 


ggggaacctg 


tttattttta 


6780 


ggggaagaat 


gctgttatta 


tgttaatcaa 


tccggaatcg 
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gattctcccc 
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tataatattg 
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7140 


gaggaaatct 


cagctgcaca 


acctctacta 


cgccccaatt 


cagcaggaag 


cagttagagc 
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7260 


gacaggacta 
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ctaggctgay 


taagaatccy 


taagcctags 


tcraaaaaata 


7320 
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cggggcttgc 
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7380 
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7440 
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ccaagtyttc 


gagccggcaa 
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cttgcarctg 
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<210> 31 

<211> 363 

<212> PRT 

<213> Homo sapiens 

<400> 31 

Met Gly Asn lie Pro Pro Lys Ala Lys Thr Pro Leu Arg Cys lie Leu 
15 10 15 

Glu Asn Trp Asp Gin Cys Asp Thr Gin Thr Leu Arg Lys Lys Arg Phe 
20 25 30 

lie Phe Phe Cys Ser Thr Ala Trp Pro Gin Tyr Pro Leu Gin Gly Arg 
35 40 45 

Glu Thr Trp Leu Pro Glu Gly Ser lie Asn Tyr Asn lie lie Leu Gin 
50 55 60 

Leu Asp Leu Phe Cys Arg Lys Glu Gly Lys Trp Ser Glu Val Pro Tyr 
65 70 75 80 

Val Gin Thr Phe Phe Ser Leu Arg Asp Asn Ser Gin Leu Cys Lys Lys 
85 90 95 



35 



Cys Gly Leu Cys Pro Thr Gly Ser Pro Gin Ser Pro Pro Pro Tyr Pro 

100 105 110 



Ser Val Pro Pro Pro Thr Pro Ser Ser Thr Asn Lys Asp Pro Pro Leu 
115 120 125 



Thr Gin Thr Val Gin Lys Glu lie Asp Lys Gly Val Asn Asn Glu Pro 
130 135 140 



Lys Ser Ala Asn lie Pro Arg Leu Cys Pro Leu Gin Ala Val Arg Gly 
145 150 155 160 



Gly Glu Phe Gly Pro Ala Arg Val Pro Val Pro Phe Ser Leu Ser Asp 
165 170 175 



Leu Lys Gin lie Lys lie Asp Leu Gly Lys Phe Ser Asp Asn Pro Asp 
180 185 190 



Gly Tyr lie Asp Val Leu Gin Gly Leu Gly Gin Ser Phe Asp Leu Thr 
195 200 205 



Trp Arg Asp lie Met Leu Leu Leu Asn Gin Thr Leu Thr Pro Asn Glu 
210 215 220 



Arg Ser Ala Ala Val Thr Ala Ala Arg Glu Phe Gly Asp Leu Trp Tyr 
225 230 235 240 



Leu Ser Gin Val Asn Asn Arg Met Thr Thr Glu Glu Arg Thr Thr Pro 
245 250 255 



Thr Gly Gin Gin Ala Val Pro Ser Val Asp Pro His Trp Asp Thr Glu 
260 265 270 



Ser Glu His Gly Asp Trp Cys His Lys His Leu Leu Thr Cys Val Leu 
275 280 285 



Glu Gly Leu Arg Lys Thr Arg Lys Lys Pro Met Asn Tyr Ser Met Met 
290 295 300 



Ser Thr lie Thr Gin Gly Lys Glu Glu Asn Pro Thr Ala Phe Leu Asp 
305 310 315 320 



36 



Arg Leu Arg Glu Ala Leu Arg Lys His Thr Ser Leu Ser Pro Asp Ser 
325 330 335 



lie Glu Gly Gin Leu lie Leu Lys Asp Lys Phe lie Thr Gin Ser Ala 
340 345 350 



Ala Asp lie Arg Lys Asn Phe Lys Ser Leu Pro 
355 360 



<210> 32 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 32 

atccaaagtg gtgagtaata 20 

<210> 33 

<211> 20 

<212> DNA 

<213> Homo sapiens 



<400> 33 

cttttttcag atgggaaacg 20 



37 
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7 
10 
13 
19 
20 
26 
32 
■> 35 
41 
C344 
0350 

CO 53 

ft 59 
Sf65 
jS71 
SI 7 4 
' y 77 
"^80 
- 86 
088 
|B 92 
1^96 

sjioo 
r ao4 

g't 108 

;r- 112 

116 
120 
124 
128 
132 
136 
140 
144 
148 
152 
156 
160 
164 
168 
172 
176 
180 
184 
188 
194 
197 



Input Set : A:\ES.txt 

Output Set: N:\CRF3\12192001\I869927.raw 

<110> APPLICANT: PARANHOS - BACCALA , Glaucia 
MALLET , Francois 
VOISSET, Cecile 

<120> TITLE OF INVENTION: ENDOGENEOUS NUCLEIC ACID FRAGMENT ASSOCIATED WITH AN 

AUTOIMMUNE DISEASE/ LABELING METHOD AND REAGENT 
<130> FILE REFERENCE: 11004 8 

<140> CURRENT APPLICATION NUMBER: 09/869/927 
<141> CURRENT FILING DATE : 2001-10-22 

<150> PRIOR APPLICATION NUMBER: PCT/FR00/00144 

<151> PRIOR FILING DATE: 2000-01-21 

<150> PRIOR APPLICATION NUMBER: FR 99/00888 

<151> PRIOR FILING DATE: 1999-01-21 

<160> NUMBER OF SEQ ID NOS : 33 

<170> SOFTWARE: Patentln version 3.1 

<210> SEQ ID NO: 1 

<211> LENGTH: 1511 

<212> TYPE: DNA 

<213> ORGANISM: Homo sapiens 

<400> SEQUENCE: 1 



cctagaacgt attctggaga attgggacca atgtgacact cagacgctaa gaaagaaacg 60 

atttatattc ttctgcagta ccgcctggcc acaatatcct cttcaaggga gagaaacctg 120 

gcttcctgag ggaagtataa attataacat catcttacag ctagacctct tctgtagaaa 180 

ggagggcaaa tggagtgaag tgccatatgt gcaaactttc ttttcattaa gagacaactc 240 

acaattatgt aaaaagtgtg gtttatgccc tacaggaagc cctcagagtc cacctcccta 300 

ccccagcgtc ccctccccga ctccttcctc aactaataag gacccccctt taacccaaac 360 

ggtccaaaag gagatagaca aaggggtaaa caatgaacca aagagtgcca atattccccg 420 

attatgcccc ctccaagcag tgagaggagg agaattcggc ccagccagag tgcctgtacc 4 80 

tttttctctc tcagacttaa agcaaattaa aatagaccta ggtaaattct cagataaccc 54 0 

tgacggctat attgatgttt tacaagggtt aggacaatcc tttgatctga catggagaga 600 

tataatgtta ctactaaatc agacactaac cccaaatgag agaagtgccg ctgtaactgc 660 

agcccgagag tttggcgatc tttggtatct cagtcaggcc aacaatagga tgacaacaga 720 

ggaaagaaca actcccacag gccagcaggc agttcccagt gtagaccctc attgggacac 780 

agaatcagaa catggagatt ggtgccacaa acatttgcta acttgcgtgc tagaaggact 840 

gaggaaaact aggaagaagc ctatgaatta ctcaatgatg tccactataa cacagggaaa 900 

ggaagaaaat cttactgctt ttctggacag actaagggag gcattgagga agcatacctc 960 

cctgtcacct gactctattg aaggccaact aatcttaaag gataagttta tcactcagtc 1020 

agctgcagac attagaaaaa acttcaaaag tctgccttag gcccggagca gaacttagaa 1080 

accctattta acttggcatc ctcagttttt tataatagag atcaggagga gcaggcgaaa 114 0 

cgggacaaac gggataaaaa aaaaaggggg ggtccactac tttagtcatg gccctcaggc 1200 

aagcagactt tggaggctct gcaaaaggga aaagctgggc aaatcaaatg cctaataggg 1260 

ctggcttcca gtgcggtcta caaggacact ttaaaaaaga ttatccaagt agaaataagc 1320 

cgcccccttg tccatgcccc ttacgtcaag ggaatcactg gaaggcccac tgccccaggg 13 80 

gatgaagata ctctgagtca gaagccatta accagatgat ccagcagcag gactgagggt 144 0 

gcccggggcg agcgccagcc catgccatca ccctcacaga gccccgggta tgtttgacca 1500 

ttgagagcca a 1511 
<210> SEQ ID NO: 2 
<211> LENGTH: 2009 
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200 <212> TYPE: DNA 

203 <213> ORGANISM: Homo sapiens 

209 <220> FEATURE: 

212 <221> NAME/KEY: misc_feature 

215 <222> LOCATION: ( 1940 )..( 1940 ) 

218 <223> OTHER INFORMATION: n = a or g or c or t/u 
225 <400> SEQUENCE: 2 

227 atacgactac tatagggcga attgggccct ctagatgcat gctcgagcgg ccgccagtgt 60 

231 gatggatatc tgcagaattc gccctttgtc cgctgtgctc ctgatccagc gaggcgccca 120 

235 ttgctgctcc caattgggct aaaggcttgc cattgttccc acacggctaa gtgcccgggt 180 

239 tcatcctaat tgagctgaac actagtcact gggttccatg gttctcttcc atgacccacg 240 

243 gcttctaata gagctctaat actcaccaca tggcccaaga ttccattcct tggaatccgt 300 

247 gaggccaaga accccaggtc agagaacacg aggcttgcca ccgtcttgga agtggcccgc 360 

f^2 51 cgccatcttg ggagctctgg gagcaaggac cccccagtaa cattttggca accacaaagg 420 

.*'S2 55 gacctccaaa gcgatgggaa acattccccc caaggcaaaa acgcccctaa gatgtattct 4 80 

^J259 ggagaattgg gaccaatgtg acactcagac gctaagaaag aaacgattta tattcttctg 540 

2?263 cagtaccgcc tggccacaat atcctcttca agggagagaa acctggcttc ctgagggaag 600 

*t;267 tataaattat aacatcatct tacagctaga cctcttctgt agaaaggagg gcaaatggag 660 

%M271 tgaagtgcca tatgtgcaaa ctttcttttc attaagagac aactcacaat tatgtaaaaa 720 

y ; J275 gtgtggttta tgccctacag gaagccctca gagtccacct ccctacccca gcgtcccccc 780 

[0279 cccgactcct tcctcaacta ataaggaccc ccctttaacc caaacggtcc aaaaggagat 840 

\|283 agacaaaggg gtaaacaatg aaccaaagag tgccaatatt ccccgattat gccccctcca 900 

£ 287 agcagtgaga ggaggagaat tcggcccagc cagagtgcct gtaccttttt ctctctcaga 960 

P|291 cttaaagcaa attaaaatag acctaggtaa attctcagat aaccctgacg gctatattga 1020 

S295 tgttttacaa gggttaggac aatcctttga tctgacatgg agagatataa tgttactact 1080 

^299 aaatcagaca ctaaccccaa atgagagaag tgccgctgta actgcagccc gagagtttgg 1140 

f^3 03 cgatctttgg tatctcagtc aggtcaacaa taggatgaca acagaggaaa gaacaactcc 12 00 

2^307 cacaggccag caggcagttc ccagtgtaga ccctcattgg gacacagaat cagaacatgg 1260 

LJ311 agattggtgc cacaaacatt tgctaacttg cgtgctagaa ggactgagga aaactaggaa 1320 

M=315 gaagcctatg aattactcaa tgatgtccac tataacacag ggaaaggaag aaaatcctac 1380 

319 tgcttttctg gacagactaa gggaggcatt gaggaagcat acctccctgt cacctgactc 1440 

323 tattgaaggc caactaatct taaaggataa gtttatcact cagtcagctg cagacattag 1500 

327 aaaaaacttc aaaagtctgc cttaggcccg gagcagaact tagaaaccct atttaacttg 1560 

331 gcatcctcag ttttttataa tagagatcag gaggagcagg cgaaacggga caaacgggat 1620 

335 aaaaaaaaaa gggggggtcc actactttag tcatggccct caggcaagca gactttggag 1680 

339 gctctggaaa agggaaaagc tgggcaaatc aaatgcctaa tagggctggc ttccagtgcg 1740 

343 gtctacaagg acactttaaa aaagattatc caagtagaaa taagccgccc ccttgtccat 1800 

347 gccccttacg tcaagggaat cactggaagg cccactgccc caggggatga agatactctg 1860 

351 agtcagaagc cattaaccag atgatccagc agcaggactg agggtgcccg gggcgagcgc 1920 

W--> 3 55 cagcccatgc catcaccctn acagagcccc gggtatgctt gaccattgag agccaggagg 1980 

359 ttaactgtct cctggacact ggcgcagcc 2009 
365 <210> SEQ ID NO : 3 
368 <211> LENGTH: 1056 
371 <212> TYPE: DNA 
374 <213> ORGANISM: Homo sapiens 
380 <400> SEQUENCE: 3 

382 ctagaacgta ttctggagaa ttgggaccaa tgtgacactc agacgctaag aaagaaacga 60 

386 tttatattct tctgcagtac cgcctggcca caatatcctc ttcaagggag agaaacctgg 120 

390 cttcctgagg gaagtataaa ttataacatc atcttacagc tagacctctt ctgtagaaag 180 
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394 gagggcaaat ggagtgaagt gccatatgtg caaactttct tttcattaag agacaactca 240 
398 caattatgta aaaagtgtgg tttatgccct acaggaagcc ctcagagtcc acctccctac 300 
402 cccagcgtcc cctccccgac tccttcctca actaataagg accccccttt aacccaaacg 360 
406 gtccaaaagg agatagacaa aggggtaaac aatgaaccaa agagtgccaa tattccccga 420 
410 ttatgccccc tccaagcagt gagaggagga gaattcggcc cagccagagt gcctgtacct 480 
414 ttttctctct cagacttaaa gcaaattaaa atagacctag gtaaattctc agataaccct 540 
418 gacggctata ttgatgtttt acaagggtta ggacaatcct ttgatctgac atggagagat 600 
422 ataatgttac tactaaatca gacactaacc ccaaatgaga gaagtgccgc tgtaactgca 660 
426 gcccgagagt ttggcgatct ttggtatctc agtcaggcca acaataggat gacaacagag 720 
430 gaaagaacaa ctcccacagg ccagcaggca gttcccagtg tagaccctca ttgggacaca 780 
434 gaatcagaac atggagattg gtgccacaaa catttgctaa cttgcgtgct agaaggactg 840 
438 aggaaaacta ggaagaagcc tatgaattac tcaatgatgt ccactataac acagggaaag 900 
442 gaagaaaatc ttactgcttt tctggacaga ctaagggagg cattgaggaa gcatacctcc 960 

Q446 ctgtcacctg actctattga aggccaacta atcttaaagg ataagtttat cactcagtca 1020 

yQ450 gctgcagaca ttagaaaaaa cttcaaaagt ctgcct 1056 

Q3456 <210> SEQ ID NO : 4 

ff?459 <211> LENGTH: 25 

^462 <212> TYPE: DNA 

^465 <213> ORGANISM: Homo sapiens 

2^471 <400> SEQUENCE: 4 

J"^473 cggacatcca aagtgatggg aaacg 2 5 

^479 <210> SEQ ID NO: 5 

si 482 <211> LENGTH: 26 

0485 <212> TYPE: DNA 

fij488 < 213 > ORGANISM: Homo sapiens 

jL-s.4 94 <400> SEQUENCE: 5 

■ s j496 ggacaggaaa gtaagactga gaaggc 2 6 

,4502 <210> SEQ ID NO: 6 
; : f505 <211> LENGTH: 2 6 
^508 <212> TYPE: DNA 

511 <213> ORGANISM: Homo sapiens 

517 <400> SEQUENCE: 6 

519 cctagaacgt attctggaga attggg 2 6 

525 <210> SEQ ID NO: 7 

528 <211> LENGTH: 26 

531 <212> TYPE: DNA 

534 <213> ORGANISM: Homo sapiens 

540 <400> SEQUENCE: 7 

542 tggctctcaa tggtcaaaca tacccg 26 

548 <210> SEQ ID NO: 8 

551 <211> LENGTH: 21 

554 <212> TYPE: DNA 

557 <213> ORGANISM: Homo sapiens 

563 <400> SEQUENCE: 8 

565 atgggaaacg ttccccccga g 21 

571 <210> SEQ ID NO: 9 

574 <211> LENGTH: 21 

577 <212> TYPE: DNA 

580 <213> ORGANISM: Homo sapiens 
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PATENT APPLICATION 

IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 
In re the Application of 

Glaucia PARANHOS-BACCALA, Francois MALLET, 
Cecile VOISSET 

Application No.: US National Stage of PCT/FR00/00144 

Filed: July 9, 2001 Docket No.: 110048 

For: ENDOGENEOUS NUCLEIC ACID FRAGMENT ASSOCIATED WITH AN 
AUTOIMMUNE DISEASE, LABELING METHOD AND REAGENT 

PRELIMINARY AMENDMENT 

Director of the U.S. Patent and Trademark Office 
Washington, D. C. 20231 

Sir: 

Prior to initial examination, please amend the above-identified application as follows: 
IN THE CLAIMS : 

Please replace claims 7, 11, 16 and 17 as follows: 

7. (Amended) Transcription product which can be obtained by transcription of at 
least said portion of the gag gene of a fragment according to claim 1 . 

1 1 . (Amended) Method according to claim 8, characterized in that the biological 
sample is a biological fluid chosen from serum, plasma, synovial fluid and urine. 

1 6. (Amended) Reagent for detecting, in a biological sample, an autoimmune 
disease or monitoring pregnancy, comprising at least one fragment according to claim 1 . 

17. (Amended) Use of a fragment according to claim 1 , for detecting, in a 
biological sample, susceptibility to an autoimmune disease, or monitoring pregnancy. 



US National Stage of PCT/FR00/00144 



REMARKS 

Claims 1-18 are pending. Claims 7, 11, 16 and 17 are amended to eliminate multiple 
dependencies. Prompt and favorable consideration on the merits is respectfully requested. 

The attached Appendix includes marked-up copies of each rewritten claim (37 C.F.R. 
§1.121(c)(l)(ii)). 



Dectfully submitted, 



WPB:JSA/zmc 

Attached: APPENDIX 

Date: July 9, 2001 

Oliff & BERRIDGE, PLC 
P.O. Box 19928 
Alexandria, Virginia 22320 
Telephone: (703) 836-6400 
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APPENDIX 

Changes to Claims: 

The following are marked-up versions of the amended claims: 

7. (Amended) Transcription product which can be obtained by transcription of at 
least said portion of the gag gene of a fragment according to claim l any on e of claims 1 to 6 . 

1 1 . (Amended) Method according to claim 8 any one of claims 8 to 10 , 
characterized in that the biological sample is a biological fluid chosen from serum, plasma, 
synovial fluid and urine. 

16. (Amended) Reagent for detecting, in a biological sample, an autoimmune 
disease or monitoring pregnancy, comprising at least one fragment according to claim 1 .a ny 
on e of claims 1 to 6, one transcription/translation product, as obtained according to th e 
method of claim 11, one sythentic peptide belonging to SEQ ID NO. 31 or one protein 
according to claim 15 or 16. 

17. (Amended) U se of a fragment according to claim l any one of claims 1 to 6, of 
a transcription/translation product, as obtained according to the method of claim 1 1, one 
syth e ntic peptid e b e longing to SEQ ID NO. 3 1 or one prot e in according to claim 15 or 16 , for 
detecting, in a biological sample, susceptibility to an autoimmune disease, or monitoring 
pregnancy. 
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E NDOGENEOUS NUCLEIC ACID FRAGMENT M g^IMg^ w ^™^? 
AUTOI^^ 

The present invention relates to an endogenous nucleic 
acid fragment of the retroviral type, integrated into 
the DNA of the human genome. 

Retroviruses are RNA viruses which replicate through a 
process termed reverse transcription, mediated by an 
RNA-dependent DNA polymerase named reverse 

transcriptase (RT) , which is encoded by the pol gene. 
The retroviral RNA also comprises at least two 
additional genes, which are the gag and env genes. The 
gag gene encodes the proteins of the backbone, i.e. the 
matrix, the capsid and the nucleocapsid . The env gene 
encodes the envelope proteins. The transcription is 
regulated by promoter regions located in the LTRs (Long 
Terminal Repeat) which border the 5'- and 3' -terminal 
ends of the retroviral genome. 

In the course of evolution, humans or their ancestors 
have integrated material of retroviral origin into 
their genome subsequent to an infection. Specifically, 
when a cell is infected, the reverse transcriptase 
makes a DNA copy of the retroviral RNA, and this DNA 
copy may then possibly integrate into the human genome. 
Retroviruses can infect germinal cells and thus be 
transmitted to future generations by vertical MendelieTi 
transmission. They are then referred to as endogenous 
retroviruses which are present in the form of proviral 
DNA integrated into the genome of all human cells. Most 
endogenous retroviruses are silent or defective. 
However, some of them have been able to conserve all or 
part of their initial properties and may be activated 
under specific conditions. The expression of endogenous 
retroviruses can range from the transcription of viral 
genes to the production of viral particles. 
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These endogenous retroviruses may be associated 
directly or indirectly with the development of certain 
pathological conditions . 

5 Endogenous retroviral structures may be in a complete 
LTR-gag-pol-env-LTR form or in truncated forms. 

Thus, in a previous patent application 

(PCT/FR98/01442 ) , the applicant screened a cDNA library 

10 using a Ppol-MSRV probe (SEQ ID NO. 18) and detected 
overlapping clones which allowed it to reconstruct a 
putative genomic RNA of 7582 nucleotides. This genomic 
RNA has an R-U5-gag-pol-env-U3-R structure. A "blastn" 
interrogation over several databases using the 

15 reconstructed genome made it possible to show that 
there is a considerable amount of related genomic (DNA) 
sequences in the human genome, which are found on 
several chromosomes. Thus, the applicant demonstrated 
the existence of partial structures of the retroviral 

2 0 type in the human genome and envisaged their potential 
role in the development of autoimmune diseases, in 
unsuccessful pregnancy or pathological conditions of 
pregnancy. 

25 Autoimmune diseases which may be mentioned by way of 
example are multiple sclerosis, rhumatoid arthritis, 
lupus erythematosus disseminatus , insulin-dependent 
diabetes and/or pathologies which are associated with 
them . 

30 

The isolation and sequencing of overlapping cDNA 
fragments and the identification of genomic (DNA) 
clones corresponding to the isolated DNA clones, 
described in the applicant's abovementioned PCT patent 
35 application, are incorporated herein by way of 
reference . 

Isolation and sequencing of overlapping cDNA fragments: 
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The information regarding the organization of the novel 
family of endogenous retroviruses named, by the 
applicant, HERV-W was obtained by testing a placenta 
cDNA library (Clontech cat#HL5014a) with the Ppol-MSRV 
(SEQ ID NO. 18) and Penv-C15 (SEQ ID NO. 19) probes and 
then carrying out a "gene walking" technique using the 
novel sequences obtained. The experiments were carried 
out with reference to the recommendations of the 
supplier of the library. PCR amplifications on DNA were 
also used in order to understand this organization. 

The following clones were selected and sequenced: 

- Clone cl.6A2 (SEQ ID NO. 20): 5' untranslated region 
of HERV-W and a portion of gag. 

- Clone cl.6Al (SEQ ID NO. 21): gag and a portion of 
pol . 

- Clone cl.7A16 (SEQ ID NO. 22) : 3' region of pol. 

- Clone cl.Pi22 (SEQ ID NO. 23) : 3' region of pol and 
start of env. 

- Clone cl.24.4 (SEQ ID NO. 24): spliced RNA comprising 
a portion of the 5' untranslated region of HERV-W, the 
end of pol and the 5' region of env. 

- Clone cl.C4C5 (SEQ ID NO. 25): end of env and 3' 
untranslated region of HERV-W. 

- Clone cl.PH74 (SEQ ID NO. 26) : subgenomic RNA: 5' 
untranslated region of HERV-W, end of pol, env, and 3' 
untranslated region of HERV-W. 

- Clone cl.PH7 (SEQ ID NO. 27): multispliced RNA: 5' 
untranslated region of HERV-W, end of env and 3' 
untranslated region of HERV-W. 

- Clone cl-PiST (SEQ ID NO. 28): partial pol gene and 
U3-R region. 

- Clone cl.44.4 (SEQ ID NO. 29): R-U5 region, gag gene 
and partial pol gene. 
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A total sequence model for HERV-W was produced with the 
aid of these clones, by carrying out sequence 
alignments. The spliced RNAs were revealed and also the 
potential splice donor and acceptor sites. The LTR, 
5 gag, pol and env entities were defined by studying 
similarity with existing retroviruses. 



The putative genetic organization of HERV-W in the RNA 
form is as follows (SEQ ID NO. 30) : 

10 

gene 1 . - 7582 . 



Location of the clones on the reconstructed genomic RNA 
sequence : 
15 cl.6A2 (1321 bp) 1-1325; 

C1.PH74 (535+2229- 2764 bp) 72-606 and 5353-7582; 
cl.24.4 (491+1457= 1948 bp); 115-606 and 5353-6810; 
cl.44.4 (2372 bp) 115-2496; 

cl.PH7 (369+297= 666 bp) 237-606 and 7017-7313; cl . 6A1 
20 (2938 bp) 586-3559; 

cl.PiST (2785+566= 3351 bp) 2747-5557 and 7017-7582; 
C1.7A16 (1422 bp) 2908-4337; 

cl.Pi22 (317+1689= 2006 bp) 3957-4273 and 4476-6168; 
C1.C4C5 (1116 bp) 6467-7582 
25 5'LTR 1..120 

/note="R of 5'LTR (5' end uncertain [sic]" 

121. .575 

/note="U5 of 5'LTR" 

misc. 579. .596 

30 /note="PBS, primer binding site, for tRNA-W" 

misc. 606 

/note="splice junction (splice donor site 
ATCCAAAGTG-GTGAGTAATA and splice acceptor 
site CTTTTTTCAG-ATGGGAAACG, clone RG083M05, 
35 GenBank accession AC000064)" 

misc. 5353 

/note="splice acceptor site for ORF1 (env)" 

misc. 55 60 
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/note="splice donor site" 
ORF 5581. .7194 

/note="ORFl env 538 AA" 
/product -=" envelope" 
5 misc. 7017 

/note="splice acceptor site for ORF2 and 
ORF 3" 

ORF 7039. .7194 

/note="ORF2 52 AA" 
10 ORF 7112 .. 7255 

/note="ORF3 4 8 AA" 
misc. 7244. .7254 

/note="PPT, polypurine tract" 
3'LTR 7256. .7582 

15 /note="U3-R of 3' LTR (U3-R junction 

undetermined) 
misc. 7563. .7569 

polyadenylation signal 



20 Identification of genomic (DNA) clones corresponding to 
the isolated DNA clones: 



A "blastn" interrogation over several databases, using 
the reconstructed genome, showed that there is a 

25 considerable amount of related sequences in the human 
genome. Approximately 400 sequences were identified in 
GenBank and more than 200 sequences in the EST bank, 
most of them in the antisense orientation. The 4 most 
significant sequences in terms of size and similarity 

30 are the sequences of the following genomic (DNA) 
clones : 

Human clone RG083M05 (gb AC0000 64 ) , the chromosomal 
location of which is 7q21-7q22, 

Human clone BAC378 (gb U85196, gb AE000660) 

35 corresponding to the alpha/delta locus of the T-cell 
receptor, located at 14qll-12, 

Human cosmid Q11M15 (gb AF045450) corresponding to 
region 21q22.3 of chromosome 21, 
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Cosmid U134E6 (embl Z83850) on chromosome Xq22 . 

The location of the aligned regions for each of the 
clones is indicated and the chromosome to which they 
5 belong is indicated between square brackets (Figure 6 
[sic]). The percentage similarity (without the large 
deletions) between the 4 sequences and the 
reconstructed genomic RNA is indicated, and also the 
presence of repeat sequences at each end of the genome 

10 and the size of the longest open reading frames (ORFs) . 
Repeat sequences were found at the ends of 3 of these 
clones. The reconstructed sequence is entirely 
contained within clone RG083M05 (9.6 Kb) and exhibits 
96% similarity. However, clone RG083M05 has a 2 Kb 

15 insertion located immediately downstream of the 5' 
untranslated region (5' UTR) . This insertion is also 
found in two other genomic clones which have a 2 . 3 Kb 
deletion immediately upstream of the 3' untranslated 
region (3' UTR) . No clone contained the three 

20 functional gag, pol and env open reading frames (ORFs) . 
Clone RG083M05 shows a 538 amino acid (AA) ORF 
corresponding to a whole envelope. Cosmid Q11M15 
contains two major contiguous ORFs of 413 AA (frame 0) 
and 305 AA (frame +1) corresponding to a truncated pol 

25 polyprotein . 

An endogenous nucleic acid fragment has now been found 
and isolated, which is integrated into the DNA of the 
human genome and which comprises or consists of at 

30 least one portion of the gag gene of an endogenous 
retrovirus associated with an autoimmune disease, or 
with unsuccessful pregnancy or pathological conditions 
of pregnancy, this portion at least encoding, directly 
or indirectly, an expression product. Of course, the 

35 invention also comprises the sequence complementary to 
said fragment. 
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Advantageously, the fragment defined above also 
satisfies at least any one of the following 
characteristics : 

5 It comprises, or consists of, said whole gag gene; 

Said portion of the fragment at least encodes the 
matrix and the capsid; 

10 It comprises, or consists of, SEQ ID NO. 1, SEQ ID 
NO. 2, SEQ ID NO. 3 or the sequence complementary to 
any one of said sequences; 

It is located on at least one of human chromosomes 1, 
15 3, 6, 7 and 16, it is preferably located on at least 
chromosome 3; 

The product of expression of said portion is messenger 
RNA; 

20 

The product of expression of said portion is 
immunologically recognized by antibodies present in a 
biological sample from a patient suffering from an 
autoimmune disease, such as multiple sclerosis; 
2 5 preferably, the biological fluid is chosen from serum, 
plasma, synovial fluid and urine. 

Another subject of the invention is an endogenous 
transcription product which is in isolated form and 
30 which can be obtained by transcription of at least said 
portion of the gag gene of a fragment of the invention. 

The invention also relates to a method for detecting 
endogenous nucleotide sequences belonging to a fragment 
35 of the invention, comprising the following steps: 

a prior step of extraction of the cellular DNA from a 
tissue or biological fluid is carried out, and then at 
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least one cycle of amplification of the cellular DNA is 
carried out, for instance by PCR, using primers in 
particular chosen from SEQ ID NO. 4 to SEQ ID NO. 9 and 
SEQ ID NO. 12 to SEQ ID NO. 17 , 

5 

the cellular DNA present in the sample is brought into 
contact with a given probe which is capable of 
hybridizing with a fragment as defined above and of 
forming a hybridization complex, said probe comprising 
10 at least 15 contiguous nucleotides, preferably 17 and 
advantageously 19 contiguous nucleotides, of SEQ ID 
NO. 3, or consisting of SEQ ID NO. 3, under suitable 
conditions for the hybridization, in particular under 
conditions of high stringency, and 

15 

the hybridization complexes formed are detected by any 
suitable means . 

Advantageously, the probe is labeled with a tracer, 
20 such as for example a radioactive tracer or an enzyme. 

The invention also relates to a method for detecting 
endogenous nucleotide sequences belonging to a fragment 
of the invention, comprising the following steps: 

25 

a prior step of extraction of the cellular DNA from a 
tissue or biological fluid is carried out, and then at 
least one cycle of amplification of the cellular DNA is 
carried out, for instance by PCR, using primers in 
30 particular chosen from SEQ ID NO. 4 to SEQ ID NO. 9 and 
SEQ ID NO. 12 to SEQ ID NO. 17, 

a step of in vitro transcription/translation of the 
amplified product is carried out, and 

35 

the product derived from the transcription/translation 
step is reacted with a serum or plasma from a patient 
with an autoimmune disease. 
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The invention also relates to a method for studying 
and/or monitoring T-cell proliferation in vitro, 
according to which the T cells from a patient are 
5 brought into contact with either 

transcription/translation products (SEQ ID NO. 31), as 
obtained according to the method above, or synthetic 
peptides derived from or belonging to SEQ ID NO. 31. 

10 Another subject of the invention is a method for the in 
situ molecular labeling of chromosomes isolated from 
patients, in which a probe labeled with any suitable 
tracer, and comprising all or part of SEQ ID NO . 3 , is 
used. 

15 

The invention also relates to: 

a recombinant protein obtained using an expression 
cassette in a bacterial host, characterized in that its 
2 0 protein sequence consists of SEQ ID NO . 31 ; the 
bacterial host is in particular E. coli; 

a reagent for detecting an autoimmune disease or 
monitoring pregnancy, comprising at least one fragment 
25 or one protein of the invention; 

the use of a fragment or of a protein of the invention 
for detecting, in a biological sample, susceptibility 
to an autoimmune disease, or monitoring pregnancy; in 
30 particular, the autoimmune disease is multiple 
sclerosis . 

Before setting out the present invention in greater 
detail, the definition of certain terms employed in the 
35 description and claims is given. 

The expression "expression product" means any product 
derived from the retroviral DNA integrated into the 
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human genome, including the transcription products 
(messenger RNA) and the products derived from the 
translation of the messenger RNA obtained. In the 
latter case, and by way of example, the product may be 
5 a peptide or a protein which is functional or 
f unctionalizable, i.e. which can become functional. 

The expression "portion encoding, directly or 
indirectly, an expression product" is intended to mean 

10 a portion which, by itself, comprises at least all or 
part of an open reading frame from which it is possible 
to deduce an amino acid sequence, and the coding 
capacity of which can be induced by elements such as, 
for example, those which may have promoter activity. 

15 This definition includes the variability which may be 
found in the coding nucleic acid sequence, provided 
that the above conditions are respected. 

Example 1 : Location of the gag gene of the HERV-W 
2 0 family on human chromosomes using the Southern blot 
technique 

In order to locate the gag gene of the HERV-W family, a 
probe corresponding to this gene from MSRV was 

25 hybridized on a nylon membrane (Hybond® N+, Amersham) 
containing 5 \ig of DNA from 24 somatic cell hybrids 
[human x rodents] (isolated human genomic DNA: 22 
autosomal chromosomes and 2 sex chromosomes) and 3 
control DNAs (human, mouse and hamster) , digested with 

30 the EcoRI restriction enzyme. 

The following probe is used: Pgag-C12 identified by SEQ 
ID NO. 3 corresponding to the coding region (of 
1056 bp) of the clone MSRV gag C12 . 

35 

1.1- Production of clone 2, C12, containing, in the 3' 
region, a portion homologous to the pol gene, 
corresponding to the protease gene, and a portion 
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homologous to the gag gene, corresponding to the 
nucleocapsid, and a 5' coding region, corresponding to 
the gag gene, more specifically the matrix and capsid 
of MSRV-1. 

5 

A PCR amplification was carried out on total RNA 
extracted from 100 jil of plasma from a patient 
suffering from MS. A water control, treated under the 
same conditions, was used as a negative control. The 

10 cDNA synthesis was carried out with 300 pmol of a 
random primer (Gibco-BRL, France) and the "Expand RT" 
reverse transcriptase (Boehringer Mannheim, France) , 
according to the conditions recommended by the company. 
A PCR (polymerase chain reaction) amplification was 

15 carried out with the Taq polymerase enzyme (Perkin 
Elmer, France) using 10 \i± of cDNA under the following 
conditions: 94°C 2 min, 55°C 1 min and 72°C 2 min, then 
94°C 1 min, 55°C 1 min and 72°C 2 min for 30 cycles and 
72 °C for 7 min, with a final reaction volume of 50 jal . 

20 

The primers used for the PCR amplification are as 
follows : 

- 5' primer, identified by SEQ ID NO. 4 
2 5 5' CGG ACA TCC AAA GTG ATG GGA AAC G 3' ; 

- 3' primer, identified by SEQ ID NO. 5 
5' GGA CAG GAA AGT AAG ACT GAG AAG GC 3 ' 

A second "nested" PCR amplification was carried out 
30 with 5' and 3' primers located inside the region 
already amplified. This second PCR was carried out 
under the same experiment conditions as those used in 
the first PCR, using 10 \il of the amplification product 
derived from the first PCR. 

35 

The primers used for the nested PCR amplification are 
as follows : 




20 ligation in E. coli bacteria, the ligation mixture was 
plated out. At the end of the procedure, the white 
colonies of recombinant bacteria were picked in order 
to be cultured and to allow the extraction of the 
incorporated plasmids according to the "DNA 

25 minipreparation" procedure (J. Sambrook, E.F. Fritsch 
and T. Maniatis, Molecular Cloning, a laboratory 
manual, Cold Spring Harbour Laboratory Press, 1989) . 
The plasmid preparation from each recombinant colony 
was cleaved with the Eco RI restriction enzyme and 

30 analyzed on agarose gel. The plasmids possessing an 
insert which was detected under UV light after staining 
the gel with ethidium bromide were selected in order to 
sequence the insert after hybridization with a primer 
complementary to the T7 promoter present on the cloning 

35 plasmid from the TA Cloning Kit®. The reaction prior to 
the sequencing was then carried out according to the 
method recommended for using the "Prism® Ready Reaction 
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Amplitaq® FS, DyeDeoxy Terminator" sequencing kit 
(Applied Biosystems, ref. 402119) and the automatic 
sequencing was carried out on the Applied Biosystems 
373 A and 377 machines, according to the manufacturer's 
instructions . 

The clone obtained, named C12, makes it possible to 
define a 1511 bp region which has an open reading frame 
in the N-terminal region of 1089 bp (fragment 434-1521 
of SEQ ID NO. 2) encoding 363 amino acids (SEQ ID 
NO. 31) corresponding to the matrix and capsid regions 
of the gag gene. 

The nucleotide sequence of C12 is identified by SEQ ID 
NO. 1. It is represented in Figure 2 with the potential 
amino acid reading frames . 

1.2- Production of the MSRV gag cl2 probe 

The probe was obtained after PCR amplification, using 
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out and labeled with [a-P 32 ] using random primers 
(Gibco-BRL, France) in accordance with instructions of 
the "Ready-to-go DNA labeling" kit (Pharmacia Biotech) . 
The unincorporated nucleotides were removed with a G-50 
5 Quick Spin column (Boehringer, Mannheim) . 

1.3- Southern blot 

The hybridization conditions are as follows: 

10 

After prehybridization for 4 hours (in 5X SSC, IX 
Denhardt's, 0.1% SDS, 50% formamide, 20 mM Tris-HCl, 
pH = 7.5, and 0.1 mg/ml of herring sperm DNA), the 
nylon membrane containing the human chromosomes was 

15 hybridized (in 5X SSC, IX Denhardt's, 0.1% SDS, 50% 
formamide, 20 mM Tris-HCl, pH = 7.5, 0.1 mg/ml of 
herring sperm DNA and 5% dextran sulfate) for 18 hours 
at 42 °C with the 32 P-labeled 1056 bp gag cl2 DNA probe 
(SEQ ID NO. 3) . After hybridization, the membrane (The 

20 BIOS Monochromosomal Somatic Cell Hybrid blot, from 
Quantum Bioprobe) hybridized with the gag probe was 
washed twice in 2X SSC/0.2% SDS solution for 15 min at 
room temperature, and twice (in 0.2X SSC/0.2% SDS) for 
15 min at 45 °C. After washing, the membrane was exposed 

25 to the X-ray film at -80°C in the presence of an 
amplifying screen . 

The results are given in Table 1 hereinafter. 
30 In this table: 

m, which signifies mouse, and h, which signifies 
hamster, correspond to the recipient cells for the 
human chromosomal DNA. 

35 

The number indicated under each chromosome corresponds 
to the number of bands encountered. 

The total number of copies of the gag gene is 66. 
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Example 2: PCR amplification of the gag gene of the 
HERV-W family on each of the isolated human 
chromosomes; verification of the specificity of the 
amplifications by Southern blot; "in vitro" 
transcription/translation (PTT) test using the PCR 
products, in order to verify the coding capacity and 
discover which of the human chromosomes have open 
reading frames for the gag gene of the HERV-W family. 

2.1- PCR amplification 

In order to amplify the HERV-W gag gene, a PCR was 
carried out on each isolated human chromosome [ NIGMS 
human/rodent somatic cell hybrid panel #2. The human 
monochromosomal NIGMS somatic hybrid mapping panel #2, 
described by H.L. Drwinga et al. and B.L. Dubois et 
al., obtained from the Coriell Institute (Camden, NJ) ] 
with the Taq polymerase enzyme (Perkin Elmer, France) 
using: 40 pmol of each primer, 25 mM of each dNTP 
(Pharmacia), 2 . 5 mM of MgCl 2 , 2.5 U of Taq polymerase 
in the standard PCR buffer (Perkin Elmer) and 300 ng of 
isolated chromosome DNA, in a final volume of 100 ul . 
The PCR conditions for amplifying the gag region are as 
follows: 3 min at 94°C; then 1 min at 94°C, 1 min at 
55°C and 3 min at 72°C for 30 cycles, and 7 min at 
72°C. 

The primers used for the PCR amplification of the gag 
gene, from an ATG introduced into the HERV-W gag 
sequence on each isolated human chromosome are as 
follows : 

- 5' primer, identified by SEQ ID NO. 14 

5'-TTT GGT AAT ACG ACT CAC TAT AGG GCA GCC ACC ATG GGA 
AAC GTT CCC CCC GAG-3' . 

The primer contains the T7 RNA polymerase promoter 
sequence, a "spacer", the Kozak sequence (translation 
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initiation site in eukaryotes) and the 5' gag sequence 

starting from the HERV-W ATG . 

- 3' primer, identified by SEQ ID NO. 15 

5 ' -TTTTTTTTTTTTTTTTTTTCAGGCTGCGCCAGTGTCCAGGAGAC-3 ' . 



The primer contains a poly-A tail (in order to 
stabilize the transcription of the RNA, represented by 
18 T bases), a stop codon (represented by TCA) and the 
sequence of the MSRV-1 protease gene (G+E+A) . 

For the amplification of the HERV-W gag gene using 
oligonucleotides defined in the LTR and protease 
regions of HERV-W, with the Taq polymerase enzyme 
(Perkin Elmer, France), the PCR conditions were as 
follows: 3 min at 94°C; then 1 min at 94°C, 1 min at 
60°C and 2 min at 72°C f 35 cycles; followed by 7 min at 
72 °C, with 50 ng of each monochromosomal DNA. 

The primers used for the PCR amplification of the gag 
gene using the oligonucleotide defined in the HERV-W 
LTR sequence, on each isolated human chromosome, are as 
follows : 

- 5' primer, identified by SEQ ID NO. 16 
5' -TGTCCGCTGTGCTCCTGATC-3' 

- 3' primer, identified by SEQ ID NO. 17 

5 , _ TTTTTTTTTTTTTTTTTTTCAGGCT ^ ^ 



The primer contains a poly-A tail (in order to 
stabilize transcription of the RNA, represented by 18 T 
bases) , a stop codon (represented by TCA) and the 
sequence of the MSRV-1 G+E+A protease gene. 

The PCR amplifications were carried out in an MJ 
Research PTC200 Peltier Thermal cycler machine. The PCR 
products (10 )Lil of each PCR product) were analyzed in a 
gel of 1% agarose in IX TBE (Tris-HCl, borate, EDTA) . 
In order to verify the specificity of the amplification 
products, 3 jal of each PCR product were analyzed in 
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agarose gel and then transferred onto a nylon membrane 
(Hybond® -N + , Amersham) (Southern blot) using 0.4 N 
NaOH. The hybridization with the gag cl2 probe 
(1056 bp) (J. Sambrook et al . , 1989) was carried out 
5 under the following conditions: after prehybridization 
for 4 hours (in 5X SSC, IX Denhardt's, 0.1% SDS, 50% 
formamide, 20 mM Tris-HCl, pH - 7.5, and 0.1 mg/ml of 
herring sperm DNA) , the nylon membrane was hybridized 
(in 5X SSC, IX Denhardt's, 0.1% SDS, 50% formamide, 

10 20 mM Tris-HCl, pH - 7.5, 0.1 mg/ml of herring sperm 
DNA and 5% dextran sulfate), for 18 hours at 42°C with 
the 32 P-labeled gag DNA probe. The gag PCR products from 
each isolated human chromosome were washed once, for 
15 min at room temperature, in a solution of 2X SSC, 

15 0.2% SDS; twice, for 15 min each wash at 65°C, in a 
solution of 0.2X SSC, 0.1% SDS; twice, for 15 min each 
at 65°C, in a solution of 0 . IX SSC, 0.1% SDS; and 
twice, for 30 min each at room temperature, in a 
solution of 0.1X SSC, 0.1% SDS. 

20 

Part of the remaining volume (4 of the PCR 

amplification products was used for the PTT "in vitro" 
transcription/translation test (Roest PAM et al . , 1993) 
(Promega, France) . The remaining volume was used for 

25 the cloning in the pCR® 2.1-TOPO vector (Invitrogen) in 
accordance with the instructions with the kit, and for 
the sequencing with the method recommended for using 
the "PRISM™ Ready Reaction Amplitaq® FS, DyeDeoxy™ 
Terminator" sequencing kit (Applied Biosystems, 

30 ref. 402119), and the automatic sequencing was carried 
out on Applied Biosystems 373A and 377 machines, 
according to the manufacturer's instructions. 

The portion encoded (SEQ ID NO. 31) by the 2009 bp 
35 fragment (SEQ ID NO. 2) was amplified by PCR with the 
Pwo enzyme (5 U/jj,1) (Boehringer Manneim, France) using 
1 (il of the minipreparation of the gag clone DNA (SEQ 
ID NO. 3) under the following conditions: 95°C 1 min, 
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60°C 1 min and 72°C 2 min for 25 cycles, with a final 
reaction volume of 50 |ul, using the primers: 
- 5' primer {Bam HI) (SEQ ID NO. 8) : 
5' ATG GGA AAC GTT CCC CCC GAG 3' (21 mer) , and 
5 -3' primer (Hind III), identified by SEQ ID NO. 9 
5 [sic] GGC CTA AGG CAG ACT TTT GAA 3' (21 mer) . 

The fragment obtained after PCR was linearized with 
Bam HI and Hind III and subcloned into the pET28C and 

10 pET21C expression vectors (NOVAGEN) linearized with 
Bam HI and Hind III. The DNA of the 1089 bp fragment in 
the two expression vectors were sequenced according to 
the method recommended for using the "PRISM™ Ready 
Reaction Amplitaq® FS, DyeDeoxy™ Terminator" 

15 sequencing kit (Applied Biosystems, ref. 402119) and 
the automatic sequencing was carried out on Applied 
Biosystems 373A and 377 machines, according to the 
manufacturer' s instructions . 

20 The expression of the nucleotide sequence of the 
1089 bp fragment of the gag clone by the pET28C and 
pET21C expression vectors is identified by SEQ ID 
NO. 10 and SEQ ID NO. 11, respectively. 

25 2.2- "In vitro" transcription/ translation test 
(PTT, Promega) 

This test was carried out in order to pinpoint the 
human chromosomes which have open reading frames for 
30 the gag gene of the HERV-W family. 

A mixture containing 12.5 jil of TNT® rabbit 
reticulocyte lysate (Promega) , 1 (il of TNT® reaction 
buffer (Promega), 0 . 5 (il of TNT® RNA polymerase 
35 (Promega), 0 . 5 (il of a 1 mM mixture of amino acids 
minus methionine, 2 |ul of 35 S-methionine (1000 Ci/mmol) 
at 10 mCi/|al (Amersham) , 0.5 \il of RNasin® ribonuclease 
inhibitor at 40 U/jal, 4 |al of PCR amplification 
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products (equivalent to 1 jag) from each human 
chromosome and 4 p.1 of water, in a reaction volume of 
25 fil, [lacuna] . This mixture was incubated at 30°C for 
90 min. 

5 

The gag proteins corresponding to the products of 
transcription/translation of the gag gene of the HERV-W 
family from each human chromosome, amplified by PCR, 
were revealed by 10% polyacrylamide gel electrophoresis 
10 in the presence of sodium dodecyl sulfate (SDS)-PAGE 
after exposure of the gel to the X-ray film at room 
temperature in the presence of an amplifying screen. 

The results are given in Table 2 hereinafter. 

15 

In this table, the number indicated under each 
chromosome corresponds to the molecular mass (kDa) of 
the proteins visualized in polyacrylamide gel in the 
presence of SDS. 
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Example 3: Expression of the gag clone in Escherichia 
coll, and reaction with human sera 

The coding region SEQ ID NO . 2 was expressed in 
5 Escherichia coli, and then the products thus expressed 
were tested against serum from patients suffering from 
MS, and also against serum from healthy patients. 

The constructs pET28c-gag clone (1089 bp) and pET21C- 
10 gag clone (1089 bp) synthesize, in the BL21 (DE3) 
bacterial strain, an N-terminal and C-terminal fusion 
protein for the pET28C vector, and a C-terminal fusion 
protein for the pET21C vector, with 6 histidine 
residues and an apparent molecular mass of 
15 approximately 45 kDa, which are revealed by SDS-PAGE 
polyacrylamide gel electrophoresis (U.K. Laemmli, 
Cleavage of structural proteins during the assembly of 
the head of bacteriophage T4, Nature, 1970, 227: 680- 
685) . 

20 

The reactivity of the protein with respect to an anti- 
histidine monoclonal antibody (DIANOVA) was 

demonstrated using the Western blot technique 
(H. Towbin et al . , Electrophoretic transfer of proteins 
25 from polyacrylamide gels to nitrocellulose sheets: 
procedure and some applications, Proc. Natl. Acad. Sci. 
USA, 1979, 76: 4350-4354). 

The recombinant proteins pET28C-gag clone (1089 bp) and 
30 pET21C-gag clone (1089 bp) were visualized, by SDS- 
PAGE, in the insoluble fraction after enzymatic 
digestion of the bacterial extracts with 50 (il of 
lysozyme (10 mg/ml) and lysis by ultrasound. 

35 The antigenic properties of the recombinant antigens 
pET28C-gag clone (1089 bp) and pET21C-gag clone 
(1089 bp) were tested by Western blot after 
solubilization of the bacterial pellet with 2% SDS and 
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50 mM p-mercaptoethanol . After incubation with the sera 
from patients suffering from multiple sclerosis, the 
sera from the neurological controls and the blood 
transfusion center (BTC) control sera, the 
5 immunocomplexes were detected using an alkaline 
phosphatase-coupled anti-human IgG and IgM goat serum. 

The results are given in Table 3 hereinafter. 

10 Table 3 



Reactivity of sera 


from patients 


suffering from 


multiple sclerosis and 


controls, with 


the recombinant 


gag protein produced in 


E. coli a 




DISEASE 


NUMBER OF 


NUMBER OF 




INDIVIDUALS 


POSITIVE 




TESTED 


INDIVIDUALS 


MS 


15 


6 






2 (+++) , 2 (++) , 






2 ( + ) 


NEUROLOGICAL CONTROLS 


2 


1 (+ + +) 


HEALTHY CONTROLS (BTC) 


22 


1 ( + /-) 



(a) The strips containing 1 . 5 \ig of recombinant gag 
antigen show reactivity against sera diluted 100-fold. 
The Western blot interpretation is based on the 
20 presence or absence of a gag-specific band on the 
strips. Positive and negative controls are included in 
each experiment. 

These results show that, under the technical conditions 
25 used, approximately 40% of the human multiple sclerosis 
sera tested react with the recombinant gag protein. 
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CLAIMS 

Nucleic acid fragment, characterized in that it 
consists of at least one portion of the gag gene 
of an endogenous retrovirus associated with an 
autoimmune disease, or with unsuccessful pregnancy 
or pathological conditions of pregnancy, said 
portion being chosen from SEQ ID NO. 2 and any 
series of contiguous nucleotides belonging to SEQ 
ID NO. 2 but not belonging to SEQ ID NO. 1 and 
encoding an expression product, or the sequence 
complementary to said fragment. 

Fragment according to claim 1, characterized in 
that it can be isolated from at least one of human 
chromosomes 1, 3, 6, 7 and 16. 

Fragment according to claim 2, characterized in 
that it can be isolated from at least chromosome 
3. 

Fragment according to claim 1, characterized in 
that the expression product is messenger RNA. 

Fragment according to claim 1, characterized in 
that the expression product is immunologically 
recognized by antibodies present in a biological 
sample from a patient suffering from an autoimmune 
disease . 

Fragment according to claim 5, characterized in 
that the autoimmune disease is multiple sclerosis. 

Transcription product which can be obtained by 
transcription of at least said portion of the gag 
gene of a fragment according to any one of claims 
1 to 6. 
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Method for detecting, in a biological sample, 
nucleotide sequences which are integrated into the 
DNA of the human genome and which belong to the 
gag gene of an endogenous retrovirus associated 
with an autoimmune disease, or with unsuccessful 
pregnancy or pathological conditions of pregnancy, 
characterized in that: 

- a prior step of extraction of the cellular DNA 
of said biological sample is carried out, and then 
at least one cycle of amplification of the 
cellular DNA is carried out, for instance by PCR, 
using primers in particular chosen from SEQ ID 
NO. 4 to SEQ ID NO. 9 and SEQ ID NO. 12 to SEQ ID 
NO. 17, 

- the cellular DNA present in the sample is 
brought into contact with a given probe which is 
capable of hybridizing with a said nucleotide 
sequence and of forming a hybridization complex, 
said probe comprising at least 15 contiguous 
nucleotides, preferably 17 and advantageously 19 
contiguous nucleotides, of SEQ ID NO. 3, or 
consisting of SEQ ID NO. 3, under suitable 
conditions for the hybridization, in particular 
under conditions of high stringency, and 

- the hybridization complexes formed are detected 
by any suitable means. 

Method according to claim 8, characterized in that 
the probe is labeled with a tracer, such as for 
example a radioactive tracer or an enzyme. 

Method for detecting, in a biological sample, 
nucleotide sequences which are integrated into the 
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DNA of the human genome and which belong to the 
gag gene of an endogenous retrovirus associated 
with an autoimmune disease, or with unsuccessful 
pregnancy or pathological conditions of pregnancy, 
characterized in that: 

- a prior step of extraction of the cellular DNA 
of said biological sample, optionally derived from 
isolated chromosomes, is carried out, and then at 
least one cycle of amplification of the cellular 
DNA is carried out, for instance by PCR, using 
primers in particular chosen from SEQ ID NO. 4 to 
SEQ ID NO. 9 and SEQ ID NO. 12 to SEQ ID NO. 17, 

- a step of in vitro transcription/translation of 
the amplified product is carried out, and 

- the product derived from the 
transcription/ translation step is reacted with a 
serum or plasma from a patient with an autoimmune 
disease . 

Method according to any one of claims 8 to 10, 
characterised in that the biological sample is a 
biological fluid chosen from serum, plasma, 
synovial fluid and urine. 

Method for studying and/or monitoring T-cell 
proliferation in vitro, according to which the T 
cells from a patient are brought into contact with 
either transcription/ translation products, as 
obtained according to the method of claim 11, or 
synthetic peptides belonging to SEQ ID NO. 31. 

Method for the in situ molecular labeling of 
chromosomes isolated from patients, in .which a 
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probe labeled with any suitable tracer, and 
comprising all or part of SEQ ID NO. 3, is used. 

Recombinant protein obtained using an expression 
cassette in a bacterial host, characterized in 
that its protein sequence consists of SEQ ID 
NO. 31. 

Protein according to claim 14, characterized in 
that the bacterial host is E . coli. 

Reagent for detecting, in a biological sample, an 
autoimmune disease or monitoring pregnancy, 
comprising at least one fragment according to any 
one of claims 1 to 6, one 

transcription/translation product, as obtained 
according to the method of claim 11, one synthetic 
peptide belonging to SEQ ID NO. 31 or one protein 
according to claim 15 or 16. 

Use of a fragment according to any one of claims 1 
to 6, of a transcription/translation product, as 
obtained according to the method of claim 11, of a 
synthetic peptide belonging to SEQ ID NO. 31 or of 
a protein according to claim 15 or 16, for 
detecting, in a biological sample, susceptibility 
to an autoimmune disease, or monitoring pregnancy. 

Use according to claim 17, characterized in that 
the autoimmune disease is multiple sclerosis. 
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10 20 30 40 SO 

123_4 56-7890 1234 567690 1234567890 1234567890 1234567890 

GCTAGAACGT ATTC7T33AGA ATTQQGACCA ATTTK^CACT CAG?£XXTAA 50 
PRTY £ G E LGP M HS DAK 
L E F TLEN WDQ CDT Q T L R 
. NV FWR IGTN V T L R R 

GAA^G^AACG ATTTATA2TC TICTCX2GTA ACAATA7CCT ICO 

KET I Y I L LQY KLA TISS 
KKR FIF FCST AWP QYP 
ERND L Y S SAV PPGH NIL 

CTICAA333A G?ffiAAOCTG GCTTOTGSG GGAA^TATAA A2TATAACAT 150 

SRE R N L AS.G KYK L.H 
LQGR ETW LPE GSIN Y N I 
FKG EKPG FLR E V . I I T S 

CATCTTACAS CTOSADC'ICr TCTTGTAGAAA C33AG33CAAA TO^TTCAAG 200 
HLTA R P L L . K G G Q M E.S 
ILQ Lr D L F CRK EGK WSEV 
SYS . T S SVER RAN GVK 

TG02A2A3X?T QCAAACITCC TTITCATIAA G?GACAACIC ACAATTAIGT 250 
AIC ANFL F I K R Q L TIM. 
PYV QTF FSLR DNS Q Lr C 
CHMC KLS F H ETTH NYV 

AAAAAGH3IG GTTMG3X TfsCfiGGAAQC CCTCftGftGIC CACCTOCCIA 300 

KVW FMP YRKP SES TSL 
KKCG LCP TGS PQSP PPY 
KSV VYAL QEA LRV HLPT 

odxmcctc crcraxnsA ciochtioctc aaci^a32^3 gacooocctt 350 

PQRP LPD SFLN..G PPF 
P S V PSPT PSS TNK DPPL 
PAS PPR LLPQ L I R TPL 

•EWCCEAAAC <rnOCAAA&3 G?*3A2&G?£A Aft3333IBAA CAATGAAXA 400 
NPN GPKG DRQ RGK Q.TK 

TQT VQK EIDK GVN NEP 
. PKR SKR R , T KG.T MNQ 

AAGAGrOGOCA AM'IUIO? ATEA2G3XC CIOCAAGCAG *n3¥»03&33 450 

ECO YSP JMPP PSS ERR 
KSAN IPR LCP LQAV RGG 
RVP IFPD YAP S K Q E E E 

AGAATTO33T TXXCICTAX 'ITmUlL'lC TCfCACmA 500 

RIRP SQS ACT FFSL RLK 
EFG PARV PVP FSL SDLK 
NSA QPE CLYL FLS Q T . 
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FlQ Z (continued) 



10 20 it. 40 50 

1234567890 1234567890 1234567890 1234S67890 1234567890 

AGCAAATTAA AAT^GACCTA CCTAAATK7T CAGATAAOX TGftCGXTAT 550 
AN N R P R . I L R P RLY 

Q I K I D L C K F <> D N P D G Y 
S K L K T VMS 0 I T L T A I 

AT7GATCTTT TACAAG33TT A3SACAATOC TTTGATCTGA CA1X3G$C&GA 600 

. CF THV R T I L . S D MER 
IDVL Q G L GQG F D L T WRD 
L M F YKG. DNP LI. HGEI 

TATAATGTTA CEACTAAATC AGACftCI^AC QXAAATGAG ^3^C?I0C03 650 
YNVT TKS DTN PK.E KCR 
IML LLN.O T L T PNE R S A A 
. CY Y.I RH.P QMR EVP 

CIUTAACK3C AG333G&3AG TTTOGOCOTC TTTQCTTATCT 700 
CNC SPRV W R S LVS QSGQ 
VTA ARE FGDL WYL SQA 
L.LQ PES LAI FGIS VRP 

AACAA2EGGA TCACAACAGA QSAA£GAACA PCVOX2CPG 750 

Q . D DNR GKNN SHR PAG 
NNRM TTE ERT TPTG Q Q A 
TIG .QQRKEQLPQASRQ 

A7TTCCCACT CTZPGAOOTK: ATIX33S^CAC ASAATCT^AA mrQSftGBTT 800 
SSQC RPS LGH RIRT WRL 
VPS VDPH WDT ESE HGDW 
FPV . TLIGTQNQNMEI 

QSTOOCaCAA A2aiT2X32I5V A L ' l ' lOLg lGC TEGAAGGftCT C3=iGgtfWCT 850 
VPQ TFAN LRA RRT EEN. 
CHK HLL TCVL E G L RKT 
G A T N I C . LAC K D GKL 

CTAIGAATTA CICAA0T5VTG TTOVTIS^TAA OTCK333&AA 900 
E E A Y E L LNDV HYN TGK 
RKKP MNY SMM STIT QGK 
GRS L.ITQ.C PL. HRER 

QG&AGAAAftT CTT^CTG^TT TIClUGfiCPG GCRTIGMGA 950 

GRKS YCF SGO TKGG I E E 
EEN LTAF LDR LRE ALRK 
KKI LLL FWTD . G R H.G 

A3CATAOCIC GBCICTATTC AAG3XAACT AASCTEAAAG 1000 

AYL PVT. L Y RPT NLKG 

HTS LSP DSIE GQL ILK 
SIPP CHLTLL KAN. S.R 
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10 20 30 40 50 

1234567890 1234S67890 1234567890 1234567890 1234567890 

GATAAGTTTA TOCTCAGIC AGCTOCAG^C ATTAGAAAAA ACTICAAAAG 1050 

. VV HSV SCHH . K K L Q K 
OKFI T Q S A A D IRKN F K S 
I S L S L S Q LOT L E K T S K V 

TCIQCCTTAg' G33333AGCA GftACTrAG'iA ACTX'JJlTTTlA ACTIOXATC 1100 
S A L G PEQ N L £ TLFN LAS 
LP. A R S R T . K PYL TWHP 
CLK PGA ELRN PI. LGI 

OOQTTTT TATAATAGAG ATCAG3AGCIA GGAGGZC^AA a333?CAAAC 1150 
SVF YNRD Q E E QAK RDKK 
QFF I I E I R R S RRN GTN 
LSFL . R SGG AGET GQT 

Q3GATAAAAA AAAAAQQ33G QTTOCACTAC TTTAGTCAIG GOXTCAGGT I2CX) 

DKK KRG GPLL . S W PSG 
GTKK KGG VKY FSHG PQA 
G.X KKGG STT LVM ALRQ 

AAQC2GAC2T TGGAGGCTCT QCAAAA033A AAAG2K333C AAATCAAATG 1250 
KQTL EAL QKG KAGQ IKC 
SRL WRLC KRE KLG KSNA 
A D F GGS AKGK SWA NQM 

CCTAATAG33 CIQQCTIOCA GHTGOSGICTA CAAGoACACT TTAAAAAAGA 1300 
LIG LASS AVY KDT LKKI 
- . G WLP VRST RTL K R 

PNRA GFQ CGL QGHF KKD 

TOmxAAGT pgz&ps&xx: uuuuu xt ig mroxa: teaotcaag 1350 

IQV EIS RPLV HAP Y V K 
LSK. K.A APL SMPL TSR 
Y P S RNKP PPC PCP LRQG 

GGAA2CACIG G^AGXCCftC TaXEEA03S fflUGAAG&m CTCIGACTCA 1400 
GITG RPT APG DEDT LSQ 
ESL EGPL PQG MKI L.VR 
NHW KAH CPRG R Y SES 

GAAQOCftTm AOTMffiT CT3iGCAGCAG GACIGH33GT UJUULUJJGG 1450 
KPL TR.S SSR TEG ARGE 
SH. PDD PAAG LRV PGA 
EAIN Q M I Q Q Q D.GC PGR 

AG33XKX3C OflOOMCA GDCTCACASV (XrXD332rm TGrTITCAOCA 1500 

RQP MPS PSOS PGY V.P 
SASP CHH PHR APGM FDH 
A P A HAIT LTE PRV CLTI 
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DECLARATION AND POWER OF ATTORNEY 
UNDER 35 USC §371(c)(4) FOR 
PCT APPLICATION FOR UNITED STATES PATENT 

As a below named inventor, I hereby declare that: 

my residence, post office address and citizenship are as stated below under my name; 

I verily believe I am the original, first and sole inventor (if only one name is listed below) or an 
original, first and joint inventor (if plural names are listed below) of the subject matter which is claimed and 
for which a patent is sought, namely the invention entitled: Endogeneous nucleic acid fragment 
associated with an auto-itnmune disease >_ labeling method and reagent . 



described and claimed in international application number PCT FR00/00144 filed 21th January 2000 . 

I have reviewed and understand the contents of the above-identified specification, including the 
claims, as amended by any amendment referred to above. 

I acknowledge the duty to disclose to the Office all information known to me to be material to 
patentability as defined in Title 37, Code of Federal Regulations §L56. 

Under Title 35, U.S. Code §119, the priority benefits of the following foreign application(s) filed by 
me or my legal representatives or assigns within one year prior to my international application are hereby 
claimed: 

France N° 99/00888 filed on 21th January 1999 

The following application(s) for patent or inventor's certificate on this invention were filed in 
countries foreign to the United States of America either (a) more than one year prior to my international 
application, or (b) before the filing date of the above-named foreign priority application(s): 

I hereby appoint the following as my attorneys of record with full power of substitution and revocation 
to prosecute this application and to transact all business in the Patent Office: 



James A, Oiiff, Reg* No* 27,075; William P. Berridge, Reg, No. 3C^024^ 
Kirk M. Hudson, Reg. No. 2Z«Sfi2; Thomas J. Fardini, Reg* No. 30yt44f. 
Edward P. Walker, Reg. No. 31,450; Robert A. Miller, Reg. No. 32,221; 
Mario A. Costantino, Reg. No. 33^65; Caroline D. Dcnnison, Reg. No. 24494; and 
Stephen J. Roe, Reg. No ._ 34,463. 

ALL CORRESPONDENCE IN CONNECTION WITH THIS APPLICATION SHOULD BE SENT TO QUFF 
& BERRIDGE, PLC, RjQJBOXi992£, ALEXANDRIA, mGINIA-22320, TELEPHONE (703) 836-6400. 

I hereby declare that I have reviewed and understand the contents of this Declaration, and that all 
statements made herein of my own knowledge are true and that all statements made on information and belief 
are believed to be true; and further that these statements were made with the knowledge that willful false 
statements and the like so made are punishable by fine or imprisonment, or both, under Section 1001 of Title 
18 of the United States Code and that such willful false statements may jeopardize the validity of the 
application or any patent issued thereon. 
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SEQUENCE LISTING 
<110> BIO MERIEUX 

<120> Endogenous nucleic acid fragment associated with an 
autoimmune disease, labeling method and reagent 

<130> SEP19-endogenous 

<140> 
<141> 

<150> FR 9900888 
<151> 1999-01-21 

<160> 31 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 1511 
<212> DKA 

<213> Homo sapiens 

cctSgaacgt attctggaga attgggacca atgtgacact. cagacgctaa gaaagaaacg 60 
atttatattc ttctgcagta ccgcctggcc acaatatcct cttcaaggga 9agaaacctg 120 
gcttcctgag goaagtataa attataacat catcttacag ctagacctct tctgtagaaa 180 
ggagggcaaa tggagtgaag tgccatatgt gcaaactttc tcttcattaa gagacaactc 240 
lcaa?catgt aaaaagtgtg gtttatgccc tacaggaagc cctcagagtc "cctcccta 300 
ccccagcgtc ccctccccga ctccttcctc aactaataag gacccccctt taacccaaac 360 
ggcccaaaag gagatagaca aaggggtaaa caatgaacca aagagtgcca atattccccg 420 
ISatgcccJ ctccaalcag cgagaggagg agaattcggc ccagccagag tgcctgtacc 480 
tttttccctc tcagacttaa agcaaattaa aacagaccta ggtaaattct cagataaccc 540 
tgacggctat attgatgttt nacaagggtt aggacaatcc tttgatctga "tggagaga 600 
tataatgtta ctactaaatc agacactaac cccaaatgag agaagtgccg ctgcaactgc 660 
agcccgagag tttggcgatc tttggtatct cagtcaggcc aacaatagga ^caacaga 720 
ggaaagaaca actcccacag gccagcaggc agttcccagt gtagaccctc f ^gggacac 780 
agaatcagaa catggagatt ggtgccacaa acatttgcta acttgcgtgc tagaaggact 840 
gaggaaaact aggaagaagc ctatgaatta ctcaatgatg tccactataa cacagggaaa 900 
ggaagaaaat ettactgctt ttctggacag actaagggag gcattgagga agcatacctc 960 
cctgtcacct gactctattg aaggccaact aatcttaaag gataagttta tcactcagtc 1020 
agctgcagac actagaaaaa acttcaaaag tctgccttag gcccggagca gaacttagaa 1080 
accctattta acttggcacc ctcagttttt tataatagag atcaggagga gcaggcgaaa 114 0 
cgggacaaac gggataaaaa aaaaaggggg ggtccactac tctagtcatg gccctcaggc 1200 
aagcagactt cggaggctct gcaaaaggga aaagctgggc aaatcaaatg cctaataggg 1260 
ctggcttcca gtgcggtcta caaggacact ttaaaaaaga ttatccaagt agaaataagc 1320 
cgcccccttg tccatgcccc ttacgtcaag ggaatcactg gaaggcccac tgccccaggg 1380 
gatgaagata ctccgagtca gaagccatta accagatgat ccagcagcag gactgagggt 144 0 
gcccggggcg agcgccagcc catgccatca ccctcacaga gccccgggta tgtttgacca 1S00 
t tgagagcca a 



<210> 2 
<211> 2009 
<212> DNA 

<213> Homo sapiens 
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<400> 2 

atacgactac tatagggcga attgggccct 
gatggatatc tgcagaattc gccccttgtc 
ttgctgctcc caattgggct aaaggcttgc 
tcatcctaat tgagctgaac actagtcact 
gcttctaata gagccctaat actcaccaca 
gaggccaaga accccaggtc agagaacacg 
cgccatcttg ggagctccgg gagcaaggac 
gacctccaaa gcgatgggaa acattccccc 
ggagaattgg gaccaatgtg acactcagac 
cagtaccgcc tggccacaat atcctcttca 
tataaattat aacatcatct tacagctaga 
tgaagtgcca tatgtgcaaa ctttcttttc 
gtgtggttta cgccctacag gaagccctca 
cccgactcct tcctcaacta ataaggaccc 
agacaaaggg gtaaacaatg aaccaaagag 
agcagtgaga ggaggagaat tcggcccagc 
cttaaagcaa attaaaatag acctaggtaa 
tgttttacaa gggttaggac aatcctttga 
aaatcagaca ctaaccccaa atgagagaag 
cgatctttgg tatctcagtc aggtcaacaa 
cacaggccag caggcagntc ccagtgtaga 
agattggtgc cacaaacatt tgctaacttg 
gaagcctatg aattactcaa tgatgtccac 
tgcttttctg gacagactaa gggaggcatt 
tattgaaggc caactaatct taaaggataa 
aaaaaacttc aaaagcctgc cttaggcccg 
gcatcctcag ttttttataa tagagatcag 
aaaaaaaaaa gggggggtcc actactttag 
gct'ctggaaa agggaaaagc tgggcaaatc 
gtctacaagg acactttaaa aaagattatc 
gccccttacg tcaagggaat cactggaagg 
agtcagaagc cattaaccag atgatccagc 
cagcccatgc catcaccctn acagagcccc 
ttaactgtct cctggacact ggcgcagcc 



ctagatgcat gctcgagcgg ccgccagtgt 60 
cgctgtgccc ctgatccagc gaggcgccca 120 
catcgttccc acacggctaa gtgcccgggt 180 
gggtticcatg gttctcttcc atgacccacg 240 
tggcccaaga ttccattcct tggaatccgt 300 
aggcttgcca ccgtcttgga agtggcccgc 360 
cccccagtaa catttcggca accacaaagg 42C 
caaggcaaaa acgcccctaa gatgtattct 480 
gctaagaaag aaacgattta tattcttctg 54 0 
agggagagaa acctggcttc ctgagggaag 600 
cctcttcrgt agaaaggagg gcaaatggag 660 
attaagagac aactcacaat tatgtaaaaa 720 
gagtccacct ccctacccca gcgtcccccc 780 
ccctttaacc caaacggtcc aaaaggagat 340 
tgccaatatt ccccgatcat gccccctcca 500 
cagagtgcct gtaccttttt ctctctcaga 960 
attctcagat aaccctgacg gctatattga 1020 
tctgacatgg agagatataa tgttactact 1080 
cgccgccgta accgcagccc gagagtttgg 1140 
taggatgaca acagaggaaa gaacaactcc 12 00 
ccctcattgg gacacagaat cagaacatgg 2260 
cgtgctagaa ggactgagga aaactaggaa 1320 
cataacacag ggaaaggaag aaaatcctac 13 80 
gaggaagcat acctccctgt cacctgactc 144 0 
gtttatcact cagtcagctg cagacattag 1500 
gagcagaact tagaaaccct atttaacttg 1560 
gaggagcagg cgaaacggga caaacgggat: 162 0 
tcatggccct caggcaagca gacttcggag 1680 
aaatgcctaa tagggctggc ttccagtgcg 1740 
caagtagaaa taagccgccc ccttgtccar 1800 
cccaccgccc caggggatga agatactctg 1860 
agcaggactg agggrgcccg gggcgagcgc 192 0 
gggtatgctt. gaccattgag agccaggagg 198 0 

2009 



<210> 2 

<2ll> 1056 

<212> DNA 

<213> Homo sapiens 

<400> 3 

ctagaacgta tnctggagaa ttgggaccaa 
tttatattct tctgcagtac cgcctggcca 
cttcctgagg gaagtataaa ttataacatc 
gagggcaaat ggagtgaagt gccatatgng 
caattatgta aaaagtgtgg tttatgccct 
cccagcgtcc cctccccgac tccttcctca 
gtccaaaagg agatagacaa aggggtaaac 
ttatgccccc tccaagcagt gagaggagga 
tttcctctct cagacttaaa gcaaattaaa 
gacggctata ttgatgtttt acaagggtta 
ataatgttac tactaaatca gacactaacc 
gcccgagagt ttggcgatct ttggtatctc 
gaaagaacaa ctcccacagg ccagcaggca 
gaatcagaac atggagattg gtgccacaaa 
aggaaaacta ggaagaagcc tatgaattac 



tgtgacactc agacgctaag aaagaaacga 60 
caatatcctc ttcaagggag agaaacctgg 12 0 
atcttacagc tagacctctt ctgtagaaag 180 
caaactttct tttcattaag agacaactca 24 0 
acaggaagcc ctcagagtcc acctccctac 3 00 
actaataagg accccccttt aacccaaacg 360 
aatgaaccaa agagtgccaa tattccccga 420 
gaattcggcc cagccagagt gcctgtacct 4 80 
atagacctag gcaaattctc agataaccct 54 0 
ggacaatcct ttgatctgac atggagagat 600 
ccaaatgaga gaagtgccgc tgtaactgca 660 
agtcaggcca acaataggat gacaacagag 720 
gttcccagtg tagaccctca ttgggacaca 780 
catttgctaa cttgcgtgct agaaggactg 84 0 
tcaatgatgt ccactataac acagggaaag 900 
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gaagaaaatc ttactgcttt tctggacaga ctaagggagg cattgaggaa gcatacctcc 960 
ctgtcacctg actctattga aggccaactia atctcaaagg ataagtttat cactcagtca 1020 
gctgcagaca ttagaaaaaa cctcaaaagt ctgcct 10S€ 



< 2 1 C > 4 
<211> 25 
<212> DNA 

- 213 > Homo sapiens 
c400^ 4 

cggacatcca aagtgatggg aaacg 25 



<210> 5 
<211> 26 
<212> DNA 

<213> Homo sapiens 
<400> 5 

ggacaggaaa gtaagactga gaaggc 26 



<210> 6 
<211> 26 
<212> DNA 

<213> Homo sapiens 
<.400> 6 

cctagaacgt attctggaga at.tggg 26 



<210> 7 
<211> 26 
<212> DNA 

<213> Homo sapiens 
<400> 7 

tggctctcaa tggtcaaaca tacccg 26 



<210> 3 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 8 

atgggaaacg ttccccccga g 21 



<210> 9 

<211> 21 

<212> DNA 

<213> Homo sapiens 



<400> 9 

ggcccaaggc agacttttga a 



21 
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<210> 10 

<211> 409 

<212> PRT 

<213> Homo sapiens 

<400> 10 

Met Gly Ser Ser His His His His His His Ser Ser Gly Leu Val Pro 
15 10 15 

Arg Gly Ser His Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg 
20 25 30 

lie Met Gly Asn lie Pro Pro Lys Ala Lys Thr Pro Leu Arg Cys lie 
35 40 45 

Leu Glu Asn Trp Asp Gin Cys Asp Thr Gin Thr Leu Arg Lys Lys Arg 
50 55 60 

Phe lie Phe Phe Cys Ser Thr Ala Trp Pro Gin Tyr Pro Leu Gin Gly 
65 70 75 SO 

Arg Glu Thr Trp Leu Pro Glu Gly Ser lie Asn Tyr Asn He He Leu 
85 90 95 

Gin Leu Asp Leu Phe Cys Arg Lys Glu Gly Lys Trp Ser Glu Val Pro 
100 105 110 

Tyr Val Gin Thr Phe Phe Ser Leu Arg Asp Asn Ser Gin Leu Cys Lys 
115 120 125 

Lys Cys Gly Leu Cys Pro Thr Gly Ser Pro Gin Ser Pro Pro Pro Tyr 
130 135 140 

Pro Ser Val Pro Pro Pro Thr Pro Ser Ser Thr Asn Lys Asp Pro Pro 
145 150 155 160 

Leu Thr Gin Thr Val Gin Lys Glu He Asp Lys Gly Val Asn Asn Glu 
165 170 175 

Pro Lys Ser Ala Asn He Pro Arg Leu Cys Pro Leu Gin Ala Val Arg 
180 185 190 

Gly Gly Glu Phe Gly Pro Ala Arg Val Pro Val Pro Phe Ser Leu Ser 
195 200 205 

Asp Leu Lys Gin He Lys He Asp Leu Gly Lys Phe Ser Asp Asn Pro 
210 215 220 

Asp Gly Tyr He Asp Val Leu Gin Gly Leu Gly Gin Ser Phe Asp Leu 
225 230 235 240 

Thr Trp Arg Asp lie Met Leu Leu Leu Asn Gin Thr Leu Thr Pro Asn 
245 250 255 



Glu Arg Ser Ala Ala Val Thr Ala Ala Arg Glu Phe Gly Asp Leu Trp 
260 265 270 
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Tyr Leu Ser Gin 
275 

Pro Thr Gly Gin 
290 

G^u Ser Glu His 
305 

Leu Glu Gly Leu 



Met Ser Thr lie 
340 

Asp Arg Leu Arg 
355 

Ser lie Glu Gly 
370 

Ala Ala Asp He 
385 

Ala Leu Glu His 



Val Asn Asn Arg 
280 

Gin Ala Val Pro 
295 

Gly Asp Trp Cys 
310 

Arg Lys Tr.r Arg 
325 

Thr Gin Gly Lys 



Glu Ala Leu Arg 
360 

Gin Leu He Leu 
375 

Arg Lys Asn Phe 
390 

His His His His 
405 



Met Thr Thr Glu 



Ser Val Asp Pro 
300 

His Lys Hi s Leu 

3 15 

Lys Lys Pro Met 

330 

Glu Glu Asn Pro 
345 

Lys His Thr Ser 



Lys Asp Lys Phe 
380 

Lys Ser Leu Pro 
395 

His 



Glu Arg Thr Thr 
285 

His Trp Asp Thr 



Leu Thr Cys Val 
320 

Asn Tyr Ser Met 
335 

Thr Ala Phe Leu 
350 

Leu Ser Pro Asp 
365 

He Thr Gin Ser 



Lys Leu Ala Ala 
400 



<210> 31 
<2I1> 393 
<212> PRT 

<213> Homo sapiens 



<400> 11 

Met Ala Ser Met 

l 

He Pro Pro Lys 
20 

Leu Glu Asn Trp 
35 

Phe lie Phe Phe 
50 

Arg Glu Thr Trp 
65 

Gin Leu Asp Leu 



Tyr Val Gin Thr 
100 

Lys Cys Gly Leu 
115 



Thr Gly Gly Gin 
5 

Ala Lys Thr Pro 



Asp Gin Cys Asp 
40 

Cys Ser Thr Ala 

55 

Leu Pro Glu Gly 
70 

Phe Cys Arg Lys 
85 

Phe Phe Ser Leu 



Cys Pro Thr Gly 
120 



Gin Met Gly Arg 
10 

Leu Arg Cys He 
25 

Thr Gin Thr Leu 



Trp Pro Gin Tyr 
60 

Ser He Asn Tyr 
75 

Glu Gly Lys Trp 
90 

Arg Asp Asn Ser 
105 

Ser Pro Gin Ser 



He Met Gly Asn 
15 

Leu Glu Arg lie 
30 

Arg Lys Lys Arg 
45 

Pro Leu Gin Gly 



Asn He He Leu 
80 

Ser Glu Val Pro 
95 

Gin Leu Cys Lys 
110 

Pro Pro Pro Tyr 
:25 
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Pro Ser Val Pro Ser Pro Thr Pro Ser Ser Thr Asn Lys Asd Pro Pro 
130 135 140 

Leu Thr Gin Thr Val Gin Lys Glu lie Asp Lvs Gly Val Asn Asn Glu 
"5 150 155 160 

Pro Lys Ser Ala Asn lie Pro Arg Leu Cys Pro Leu Gin Ala Val Arg 
165 170 175 

Gly Gly Glu Phe Gly Pro Ala Arg Val Pro Val Pro Phe Ser Leu Ser 
180 iss 190 

Asp Leu Lys Gin lie Lys He Asp Leu Gly Lys Phe Ser Asp Asn Pro 
135 200 205 

Asp Gly Tyr lie Asp Val Leu Gin Gly Leu Gly Gin Ser Phe Asd Leu 
210 215 220 

Thr Trp Arg Asp lie Met Leu Leu Leu Asn Gin Thr Leu Thr Pro Asn 
22 5 230 235 240 

Glu Arg Ser Ala Ala Val Thr Ala Ala Arg Glu Phe Gly Asp Leu Trp 
245 250 " 255 

Tyr Leu Ser Gin Ala Asn Asn Arg Met Thr Thr Glu Glu Arg Thr Thr 
260 265 270 

Pro Thr Gly Gin Gin Ala Val Pro Ser Val Asp Pro His Trp Asp Thr 
275 280 285 

Glu Ser Glu His Gly Asp Trp Cys His Lys Kjls Leu Leu Thr Cys Val 
290 295 300 

Leu Glu Gly Leu Arg Lys Thr Arg Lys Lys Pro Met Asn Tyr Ser Met 
305 310 315 320 

Met Ser Thr lie Thr Gin Gly Lys Glu Glu Asn Leu Thr Ala Phe Leu 
325 330 335 

Asp Arg Leu Arg Glu Ala Leu Arg Lys His Thr Ser Leu Ser Pro Asp 
340 345 350 

Ser He Glu Gly Gin Leu He Leu Lys Asp Lys Phe lie Thr Gin Ser 
355 360 365 

Ala Ala Asp He Arg Lys Asn Phe Lys Ser Leu Pro Lys Leu Ala Ala 
370 375 380 

Ala Leu Glu His His His His His His 
385 390 



<210> 12 
<211> 26 
<212> DNA 

<213> Homo sapiens 
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<400> 12 

ctagaacgta ttctggagaa ttggga 

< 2 1 0 ,> 13 

<211> 20 

<212> DNA 

<2]3> Homo sapiens 

< 4 0 0 > 13 

ectaaggcag act tzrgaag 



<210> 14 

<211> 54 
<212> DNA 

<2 13> Homo sapiens 

<400> 14 

tttggtaata cgactcacta tagggcagcc accatgggaa acgztccccc cgag 

<210> 15 

<211> 44 

<212> DNA 

<213> Homo sapiens 

<400> 15 

tttttttttt tttttttttc aggctrgcgcc agtgtccagg agac 

<210> 16 

<2il> 20 

<212> DNA 

<213> Homo sapiens 

<40Q> 16 

tgtccgctgt gctcctgatc 

<210> 17 

<211> 44 

<212> DNA 

<213> Homo sapiens 

<400> 17 

tttttttttt tttttttttc aggctgcgcc agtgtccagg agac 

<210> 18 

<211> €78 

<212> DNA 

c2i3> Homo sapiens 

<400> 18 

tcagggatag cccccatcta tttggccagg cattagccca agacttgagc cagttctcat 
acctggatat tcttgtcctt tggtatgcgg atgatttact tttagccgcc cgttcagaaa 
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ccttgtgcca tcaagccacc caagtgcrct 

tttccaaacc aaaggctcag ctctgctcac 

ggctgaaatt atccaaaggc accagggccc 

cttatcctta tcccaaaacc ctaaaacaac 

caggcataac aggtttctgc tgaatatgga 

tanatacact aattaaggaa actcagaaag 

aagcagaggc agctttccag gccgtaaaga 

tqccagcggg gcaagactct tctttctgtg 

tccttacaca ggtccgaggg accagcttgc 
atgtagtggc aaagggtt 



taaatttccn cgccacctgt ggctacaagg 180 
agcagaaggc tatttaccct aaatactxag 24 0 
tcagtgagga atgtatccag cctatactgg 3CQ 
taagaaggtt ccttggcata ataggcataa 360 
ttcccaagta cggcaaaata gccagaccai. 420 
ccaataccca rctagtaaga tggacacctg 480 
acaccctaac ccaagcccca gtgttaagct 540 
Lcacagaaaa aataggaata gctntaggag 600 
aacccatggc atacctgagt aaggaaattg 660 

678 



<210> 19 

<211> 591 

<212> DNA 

<213> Homo sapiens 

<400> 19 

ccatggccat ctacactgaa caagatttac 

aaagagtacc cactcttcct tttgttatca 

gcattggcag tatcacaacc tctactcagt 

gtgacatgga acaggtcact gactccctgg 

cagcagtagt ccttcaaaat cgaagagctc 

cctgtttatt tttaggagaa gaacgccgtt 

agaaagttaa agaaattcga gatcgaatac 

aacgctgggg cctcctcagc caatggatgc 

cagctctaat attgttactc ctctttggac 

tctcttccag aatrga&gct gtaaagctac 



acaatcatgt cgtacctaag ccccacaaca 60 
gagcaggagt gctaggcaga ctaggtactg 120 
tctactacaa actatctcaa gaaataaatg 180 
tcaccttgca agatcaactt aactccctag 240 
tagacttgct aaccgccaaa agagggggaa 300 
attacgtcaa tcaatccaga attgtcactg 360 
aatgtagagc agaggagctt caaaacaccg 420 
cctgggtxct ccccttctta ggacctctag 480 
cctgtatctt taacctcctt gttaagtttg 540 
agatggtctt acaaatctag a S9i 



<2l0> 20 
<211> 1321 
<212> DNA 

<213> Homo sapiens 
<400> 20 

caacaatcgg gatataaacc caggcattcg 
cttccctttg tatgggagct gttttcatgc 
ctcttctggt ccatgtttct tacggctcga 
tgtttgccac caccgcagac ctgccgctga 
cgctgtgctc ctgatccagc gaagcgccca 
cattgttcct gcacggctaa gtgcctgggt 
gggttccatg gttctcttct gtgacccacg 
tggcccaaga ttccattcct tggaatccgt 
aagcttgcca ccatcttgga agcggcctgc 
cgggagctct gtgagcaagg accccccggt 
aagcgatggg aaacgttccc cgcaagacaa 
gggaacaatt: tgaccctcag acactaagaa 
cctggcactc ctgagggaag tataaattat 
agaaaaggca aatggagtga agtgccataa 
tcacaattat gtaaaaagtg tgatttatgc 
tatcccagca tccccgactc cttccccact 
ccaaaaggag atagacaaaa gggtaaacag 
atgacccctc caagcagtgg gaggaagaga 
ttctctccca gacttaaagc aaataaaaac 
tggccatatt ggtgttttac aagggttagg 
atatgtcact gct.aaat.cag acactaaccc 
cctgagagtt tggcgatctc tggtatctca 



agctggcaac agcagccccc ctttgggtcc 60 
tatttcactc tactaaatct tgcaactgca 120 
gctgagcttt tgctcaccgt ccaccactgc 180 
ctcccatccc tccggatcct gcagggtgtc 24 0 
ctgccgctcc caattgggct aaaggcttgc 300 
ttgttctaat tgagctgaac actagtcact 360 
gcttctaata gaactataac acttaccaca 420 
gaggccaaga actccaggtc agagaatacg 4 80 
taccatcttg gaagtggttc accaccatct 540 
aacattttgg caaccacgaa cggacatcca 600 
aaacgcccct aagacgtatt ctggaaaatt 660 
agaaacgact tacattcttc tgcagtgccg 720 
aacaccatct tacagctaga cctcttttgt 7 80 
gtacaaactt tcttttcatt aagagacaac 840 
cctacaggaa gccttcagag tctacctccc 900 
taataaggac cccccttcaa cccaaatggt 960 
tgaaccaaag agtgccaata ttccccaatt 1020 
attcggccca gccagagtgc atgtgccttt 1080 
agacttaggt aaattctcag ataaccctga 1140 
acaattcttt gatctgacat ggagagatat 12 0 0 
caaatgagag aagtgccacc ataactgcag 1260 
gtcaggtcaa tgataggatg acaacagagg 132 0 
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■c210> 21 
<211> 2938 
<212> DNA 

<213> Homo sapiens 
<400> 21 

caacgacgga catccaaagt gatgggaaac gtr.ccccgca agacaaaaac qcccctaaga 60 
cgcattctgg agaattggga ccaacttgac cctcagacac taagaaagaa acgacttata 120 
ttcttccgca gtgccgcctg gcactcctga gggaagtata aattataaca ccatcttaca 180 
gctagacttc ttttgtagaa aaggcaaatg gagtgaagtg ccacaagtac aaactttctt 240 
ttcattaaga gacaactcac aattatgtaa aaagtgtgat ctatgcccta caggaagcct 300 
tcagagtcta cctccctatc ccagcatccc cgactccttc cccaactaat aaggaccccc 360 
cttcaaccca aatggtccaa aaggagatag acaaaagggt aaacagtgaa ccaaagagtg 420 
ccaatattcc ccaattatga cccctcccaa gcagtgggag gaagagattc ggcccagcca 4 80 
gagcgcatgt gctttittctt czcccagact taaagcaaat aaaaacagac ttaggtaaat 540 
tctcagataa tcctgatqgc tatattgatg ttttacaagg gttaggacaa ttctttgatc 600 
tgacatggag agatataatg tcactgctaa atcagacact aaccccaaat gagagaagtg 660 
ccaccataac tgcagcctga gagtttggcg atccctggta tctcagtcag gtcaatgata 720 
ggatgacaac agaggaaaga gatgatcccc acagccagca agcagttccc agtctasacc 780 
ctcattgggg acacagaaat cagtaacatg ggagatcggt gctgcagaca tttgctaact 840 
tgtgtgctac aaggactaag gaaaactacg aagaaaatct acgaattact caatgatgtc 900 
caccataaca caggggaagg gaagaaaatc ctactgcctt tctggagaga ctaagggagg 960 
cattgaggaa gcgtgcctct ctgtcacctg actcttctga aggccaacta atcttaaagc 1020 
gtaagtttat cactcagtca gctgcagaca ttagaaaaaa cttcaaaagt ctgccgtagg 1080 
cccggagcaa aacttagaaa ccctattgaa cttggcaacy tcggtttttt ataatagaga 1140 
tcaggaggag caggcggaac aggacaaacg ggattaaaaa aaaggccacc gctttagtca 1200 
tgaccctcag gcaagtggac tttggaggct ctggaaaagg gaaaagctgg gcaaattgaa 1260 
tgcctaatag ggcttgcttc cagtgcggtc tacaaggaca ctttaaaaaa gattgtccaa 1320 
gtagaagtaa gccgcccctt cgcccatgcc ccttatttca agggaatcac tggaaggccc 1380 
actgccccag gggacaaagg tcttttgagt cagaagccac taaccagatg atccagcagc 1440 
aggactgagg gtgcctgggg caagcgccat cccatgccat caccctcaca gagccctggg 1500 
tatgcttgac cattgagggc caggaaggtt gtctcctgga cactggtgcg gtcttcttag 1560 
tcttactctt ctgtcccgga caactgtcct ccagatctgt cactatctga gggggtccta 1620 
agacgggcag tcactagata cttctcccag ccactaagtt atgactgggg agctttattc 1680 
ttttcacatg cttttctaat tatgcttgaa agccccacta ccttgttagg gagagacatt 174 0 
ctagcaaaag caggggccat tatacacctg aacataggag aaggaacacc cgtttgttgt 1800 
cccctgcttg aggaaggaat taatcctgaa gtctgggcaa cagaaggaca atatggacga 1860 
gcaaagaatg cccgtcctgt tcaagttaaa ctaaaggatt ccacttcctt tccctaccaa 1920 
aggcagcacc ccctcagacc caaggcccaa caaggattcc aaaagattgt taaggactta 1980 
aaagcccaag gcttagtaaa accatgcata actccctgca gtaattccgt agtggattga 204 0 
ggaggcacag aaacccagtg gacagtggag ggttagtgca agatctcagg attatcaatg 2100 
gaggccgttg tccttttata cccagctgta cctagccctt atactgtgct ttcccaaata 2160 
ccagaggaag cagagtggtt tacactcctg gaccttaagg atgccttctt ccgcatccct 2220 
gcacatcctg actctcaatt cttgtttgcc ttngaagata cttcaaaccc aacatctcaa 2280 
ctcacctgga ctgttttacc ccaagggttc agggatagcc cccatctatt tggccaggca 2340 
ttagcccaag acttgagcca atcctratac ctggacactt gtccttcggt aggtggatga 2400 
tttacttttg gccgcccatt cagaaacctt gtgccatcaa gccacccaag cgctcttcaa 2460 
tttcctcgct acctgtggct acatggtttc caaaccaaag gctcaactct gctcacagca 2520 
ggttacttag ggctaaaatt atccaaaggc accagggccc tcagtgagga acacatccag 2580 
cctatactgg cttatcctca tcccaaaacc ctaaagcaac taaggggatt ccttggcgta 2640 
ataggtttct gccgaaaatg gattcccagg tttggcgaaa tagccaggtc attaaataca 2700 
ctaattaagg aaactcagaa agccaatacc catttagtaa gatggacaac tgaagtagaa 2760 
gtggctttcc aggccctaac ccaagcccca gtgttaagtt tgccaacagg gcaagacttt 2820 
tcttcatatg tcacagaaaa aacaggaata gctctaggag tccttacaca gatccgaggg 2880 
atgagcttgc aacctgtggc gtacctgact aaggaaattg acgtagtggc aaagggtt 2938 
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<2I0> 22 
<211> 1422 
<222> DNA 

<213> Homo sapiens 
<400> 22 

tcagggatag cccccatcta ttitggccagg 
acctggacac tcttgncctt cagtatgtgg 
ccttgtgcca tcaagccacc caagcactct 
ttcccaaaga gaagctcagc tctgctcaca 
caaaggcacc aaggccctca gtgaggaatg 
caaaacccta aagcaactaa gagagttcct 
tccccaggta tggcaaaata gccaggccat 
ccaataccca tttaataaga tggatacctg 
aaggccttaa acccaagtcc cagtgttaag 
catcacagaa aaaaacagaa acagctctgg 
tgcaacccat ggcatacctg agtaaggaaa 
gtctacgggt agtggtggca gtagcagttg 
ggagagatct tactgtgtgg acatctcatg 
acttgtggct gtcagacaac cgtttactta 
tgctgcaact gtgcacttgt gcaactctta 
atagaataca actgtcaaca aataatttct 
gaagttccct tgactgatcc tgaccttcaa 
aaaaggactt caaaagcggg gtatgcagtg 
ccctcactcc aggaactagt gcttagctgg 
aattaggaga aggaaaaagg gtaaatatat 
tccatgccca tgaggcaata tgcagagaaa 
tcacacatca ggaagccatt aggagattat 
aagtcttaca ctgctggggt. catcagaaag 
aagcagacat tgaagcaaaa agagctgcaa 



catcagccca agacrtgagt cagtzatcat 60 
atgatttact tttagctgcc -gttcagaaa 120 
taaatttcct cgccacctgn ggctacaagg 180 
gcaggttaaa tacttaggac taagattatc 24 0 
tatccagcct atactggctt atcctcatct 300 
tggcataaca ggcttctccc gaatatggat 360 
uatatacagt aattaaggaa actcagaaag 420 
aagccaaagt ggctttccag gcccctaaag 480 
cctgccaacg gggcaagact tttctttata 540 
gagtccttac acaggtccaa gggacgagct 600 
ctgatgtagt ggcaaagggt tggcttcatt 660 
tagtatccga agcagttaaa ataatacagg 720 
aggtgaacag catactcact gctaaaggag 78 0 
aatatcaggc tctattactt gaaaggccag 840 
acccagtcnc atttcttcca gacaatgaag 900 
caaacctatg ccacccgagg ggaccttcta 960 
cttgtacact gatggaagtt cctttgtaga 1020 
gtcagtgata atggaatatt tgaaagtatc 1080 
cagaactaat agccttcatt ggggcactag 1140 
atacagactc tgagtatgct cacctagtcn 1200 
gggaattcct aacttccgag ggaacaccta 1260 
tactggcagt acagaaacct aaagaggtgg 1320 
gaaagaaaag ggaaacagaa gggaatitgcc 13 80 
ggcaggaccc tc 1422 



<210:> 23 
<211> 2006 
<212> DKA 

<213> Homo sapiens 
<400> 23 

atgcagtggt cagtgataat ggaatacttg 
ctcagctagc agaactaata gccctcactt 
caaatatata tacagactct aaatatgctc 
ggaaagaaag ggaattccta acttctgaga 
ggaaattatt attggctgta cagaaaccta 
atcanaaagg aaaggaaagg gaaaatactt 
aaaacccttc atcaaacctt tcacttaggc 
ttatttactg gaccaggcct tttcaaaact 
tgccaaaaaa ataatcccct gcctcatcgc 
cattaccctg aaaaaaactg gcaactgatt 
agratctact agtctgggta aatactttca 
cagaaaaggc ccaagaggta acaaaggcac 
ttccccgagg cttacagagt gacaatagcc 
tatcccaggc gttaggtata cgatatcact 
ggaaggtcga gaaaatgaat gaaatactca 
cccacctcac atggcctgcc ctgttgccta 
aaaaagcagg acttagccca tacgaaatgc 
ttgtgcttga cccaagacag ccaacttagt 



aaagtaatcc cctcactcca ggaactagtg 60 
gggcactaga attaggagaa gaaaaaaggg 12 0 
acctagtcct ccatgcccat gcagcaatat 180 
gaacacctat caaacatcag gaagccatta 240 
aagaggtggc agtcttacac tgccggggtc 3 00 
tcgcctgcaa ctatccaatg gaaattactt 360 
atcgatagca cccatcaaat ggccaaatca 420 
accaagcaaa tattcagggc ctgtgaattg 480 
caagctcctt caggaaaaca aaaaacaggc 54 0 
ctacccacaa gcccaaacct cagggattcc 600 
cgggttgggc aaaggccttc ccctgtagga 650 
tagctcatga aataattccc agattcggac 720 
ctgctttcca ggccacagta acccagggag 780 
tacactgcgc ctgaaggcca cagtcctcag 840 
aaggacatct aaaaaagcaa acccaggaaa 900 
tagccttaaa aagaatctgc aactttcccc 960 
tgtatggaag gcccttcata accaatgacc 1020 
tgcagacatc acctccttag ccaaatacca 1080 
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acaagttctt aaaacattac aaggaaccta 
cccttgtgac atggtattag tcaagtccct 
ctgggaagga ccctacccag tcattttatt 
ggtcttggat acaccacact tgagtcaaat 
caggagacaa cgctagctat tcctgtgaac 
aacaaccagg aggaaagtaa ctaaaatcat 
tttctcttta ctgtt.ctttt accctctctc 
atgaccagta gctcccctta ccaagagttt 
gatgccccat cgtataggag tctttctaag 
atgccccgca acigctatca ctctgccact 
acaggaaaaa tgattaatcc tagttgtcct 
tacttcaccc aaactggtat gtctgatggg 
catgtaaaag aagtaatctc ccaactcacc 
gacnagatct ctcaaaacta catgaaaccc 
ttaataccac cctcaccggg ctccatgagg 
tatgcctccc cctgaacttc aagcca 



tccctgagaa gagggaaaag aactattcca 114 0 
tctctctaat tccccatccc tagatacatc 1200 
taccccaact gcggttaaag tggctggagt 1260 
cctggatact gccaaaggaa cctgaaaatc 1320 
ctctagagga tttgcgcctg ctcttcaaac 1380 
aaatccccca tggccctccc ttatcatatt 1440 
actcccactg caccccctcc atgccgctgt 1500 
ctatggagaa tgcagcgtcc cggaaatatt 1560 
ggaaccccca ccttcactgc ccacacccat 1620 
ctctgcatgc atgcaaatac tcattattgg 16BO 
ggaggacttu gagccactgt ctgttggact 174 0 
ggtggagctc aagatcaggc aagagaaaaa 1800 
cgggtacatg gcacctctag ccctacaaag I860 
tccgtaccca tactcgcctg gtaagcctat 1920 
tctcggccca aaaccctact aactgttgga 1980 

2006 



<210> 24 
<211> 1948 
<212> DMA 

<213> Homo sapiens 
<400> 24 

actgcactct tctggtccat gtttcttacg gctcgagccg agcttttgct caccgtccac 60 
cactgctgtt tgccaccacc gcanacctgc cgctgactcc catccctctg gatcctgcag 120 
ggtgtccgct gtgctcctga tccagcgagg cgcccattgc cgcccccaat tgggctaaag 180 
gcttgccatt gtncctgcac ggctaagtgc ctgggtttgt tctaattgag ctgaacacta 24 0 
ntcactgggt tccatggttc tcttctgtga cccacggctt ctaa:agaac tataacactt 300 
accacatggc ccaagattcc attccttgga atccgtgagg gcaagaactc caggtcagag 360 
aatacgaggc ttgccaccat cttggaagcg gcctgctacc atcttggaag tggttcacca 420 
ccatcttggg agctctgtga gcaaggaccc cccggtaaca ttttggcaac cacgaacgga 4 80 
catccaaagt gatacatcct gggaaggacc ctacccagtc attttatcta ccccaactgc 54 0 
ggttaaagtg gctggagtgg agtcttggat acatcacact tgagtcaaat cctggatact 600 
gccaaaggaa cctgaaaatc caggagacaa cgctagctat tcctgtgaac ctctagagga 660 
tttgcgcctg ctcttcaaac aacaaccagg aggaaagtaa ctaaaatcat aaatccccat 72 0 
ggccctccct tatcatattt ttctctttac tgttgtttca ccctctttca ctctcactgc 780 
accccctcca tgccgctgta tgaccagtag ctccccttac caagagtttc tatggagaat 84 0 
gcagcgtccc ggaaatattg atgccccatc gtataggagt ctttgtaagg gaacccccac 900 
cttcactgcc cacacccata tgccccgcaa ctgctatcac tctgccactc tttgcatgca 960 
tgcaaatact cattattgga caggaaaaat gattaatcct agttgtcctg gaggacttgg 1020 
agtcactgtc tgttggactt acttcaccca aactggtatg tctgatgggg gtggagttca 1080 
agatcaggca agagaaaaac atgtaaaaga agtaatctcc caactcaccc gggtacatgg 114 0 
cacctctagc ccctacaaag gactagatct ctcaaaacta catgaaaccc tccgtaccca 1200 
tactcgcctg gtaagcctat ttaataccac cctcactggg ctccatgagg tctcggccca 1260 
aaaccctact aactgttgga tatgcctccc cctgaacttc aggccatatg tttcaatccc 1320 - 
tgtacctgaa caatggaaca acttcagcac agaaataaac accacttccg ttttagtagg 1380 
acctcttgtt tccaatctgg aaataaccca tacctcaaac ctcacctgtg taaaatttag 144 0 
caatactaca tacacaacca actcccaatg catcaggtgg gtaactcctc ccacacaaat 1500 
agtctgccta ccctcaggaa tattttttgt ctgtggtacc tcagcctatc gttgtttgaa 1560 
tggctcttca gaatctatgt gcttcctctc attcttagtg ccccctatgg ccatctacac 1620 
tgaacaagat ttatacagtt atgtcatatc taagccccgc aacaaaagag tacccattct 1680 
tccttttgtt ataggagcag gagtgctagg tgcactaggt actggcattg gcggtatcac 174 0 
aacctctact cagttctact acaaactatc tcaagaacta aatggggaca tggaacgggt 1800 
cgccgactcc ctggtcacct tgcaagatca acttaactcc ctagcagcag tagtccttca 1860 
aaatcgaaga gctntagact tgctaaccgc tgaaagaggg ggaacctgtt tatttttagg 1920 
ggaagaatgc tgttattatg ttaatcaa 1948 
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<210> 25 
<211> 1136 
<212> DNA 

<213> Homo sapiens 
<400> 25 

ccacggccat ctacactgaa caagacttat 
aaagagtacc cattcttcct tttgttatag 
gcatztggcgg tatcacaacc tctactcagt 
gggacatgga acgggtcccc gactccctgg 
cagcagtagt ccttcaaaac cgaagagctt 
cctgtttatt tttaggggaa gaatgctgtt 
agaaagttaa agaaattcga gatcgaatac 
gaccctgggg cctcctcagc caatggatgc 
cagctataat attgctactc ctctttggac 
tctcttccag aatcgaagct gtaaaactac 
agatctaccg cagacccctg gaccggcctg 
aaggcacccc tcctgaggaa arctcagctg 
gaagcagtta gagcggtcgt cggccaacct 
atgggggact gagagacagg actagctgga 
tagctgggaa ggtgaccaca tccaccttta 
gaccaatcag agagctcact aaaatgctaa 
caatcatcta ttgcatgaga gcacagcagg 
cttcgagccg gcaacggcaa ccccctttgg 
tcatgctatt tcactctatt aaatcttgca 



acagttatgt catatctaag ccccgcaaca 60 
gagcaggagt gctaggtgca ctaggcactg 120 
tctactacaa actatctcaa gaactaaatg 180 
tcaccttgca agatcaactt aactccctag 240 
tagactcgct aaccgctgaa agagggggaa 3 00 
attacgttaa tcaatccgga atcgtcactg 360 
aacgtagagc agaagagctt cgaaacactg 420 
cctggattct ccccttctta ggacctctag 48c 
cctgtatctt taacctcctt gttaactttg 540 
aaacggagcc caagangcag tccaagacta 600 
ctagcccacg atctgangtt aatgacatca 660 
cacaacctct actacgcccc aattcagcag 720 
ccccaacagc acttaggttt tcctgttgag 780 
tttcctaggc Cgactaagaa tccctaagcc 84 0 
aacacggggc ttgcaactta gttcacacct 900 
ttaggcaaag acaggaggta aagaaatagc 960 
agggacaatrg atcgggatat aaacccaagt 102 0 
gtcccctccc tttgtatggg agctctgttt 1080 
gctgcgaaaa aaaaaaaaaa aaaaaa 113 6 



<210> 26 

<211> 2782 

<212> DNA 

<213> Homo sapiens 

<4G0> 26 

at 999 a 9Ctg tcttcatgct atctcactct 
catgtttctt acggctcgag ctgagctttt 
accgcagacc tgccgctgac tcccacccct 
tgatccagcg aagcgcccat tgccgctccc 
cacggctaag tgcctgggtt tgttctaatt 
ttctcttctg tgacccacgg cttctaatag 
tccattcctt ggaatccgtg aggccaacga 
ccatcttgga agcggcctgc taccatcttg 
gtgagcaagg accccccggt gacattttgg 
tcctgggaag gaccctaccc agtcatttta 
gtggagtctt ggatacatca cacttgagtc 
aatccaggag acaacgctag ctattcctgt 
aaacaacaac caggaggaaa gtaactaaaa 
acttttctct gtagtgttct ttcaccctgt 
tgtatgacca gtagctcccc tcacccagag 
attgatgccc catcgtatag gagtctttct 
catatgcccc gcaactgcta tcactctgcc 
tggacaggaa aaatgattaa tcctagttgt 
acttacttca cccaaactrgg tatgtctgat 
aaacatgtaa aagaagtaat ctcccaactc 
aaaggactag atctctcaaa actacatgaa 
ctatttaata ccaccctcac tgggctccat 
tggatatgcc tccccctgaa cttcaggcca 
aacaacttca gcacagaaat aaacaccact 



attaaacctt gcaacrgcac tcttctggtc 60 
gctcaccgtc caccactgct gtttgccacc 120 
ctggatcctg cagggtgtcc gctgfcgctcc 180 
aattgggcta aaggcttgcc attgttcctg 24 0 
gagctgaaca ctagtcactg ggttccatgg 300 
aactataaca cttaccacat ggcccaagat 3 60 
actccaggtc agagaatacg aagcttgcca 420 
gaagtggttc accaccatct tgggagctct 480 
cgaccaccaa cggacatccc aagtgataca 54 0 
tctaccccaa ctgcggttaa agtggctgga 600 
aaatcctgga tactgccaaa ggaacctgaa 660 
gaacctctag aggatttgcg cctgctcttc 720 
tcataaatcc ccatgggcct cccttatcat 780 
ttcactctca ctgcaccccc tccatgccgc 84 0 
tttctatgga gaatgcagcg tcccggaaat 900 
aagggaaccc ccaccttcac tgcccacacc 96 0 
actctttgca tgcatgcaaa tactcattat 1020 
cctggaggac ttggagtcac tgtctgttgg 1080 
gggggtggag ttcaagatca ggcaagagaa 114 0 
accggggtac atggcacctc tagcccctac 1200 
accctccgta cccatactcg cctggtaagc 1260 
gaggtctcgg cccaaaaccc tactaactgt 1320 
tatgtttcaa tccctgtacc tgaacaatgg 1380 
tccgttttag taggacctct tgtttccaat 1440 
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gtggaaataa cccatacctc aaacctcacc 
accaaccccc aatgcatcag gtgggtaact 
ggaatatttt ttgtctgtgg taccccagcc 
atgtgcttcc tctcattctt agtgcccccr 
agttatgtca tatctaagcc ccgcaacaaa 
gcaggagtgc taggtgcact aggtactggc 
tactacaaac tatctcaaga actaaatggg 
accttgcaag atcaacttaa ctccctagca 
gacttgctaa ccgctgagag agggggaacc 
tatgttaatc aatccggaat cgtcactgag 
cgtatagcag aggagcttcg aaacaccgga 
tggattctcc ccttcttagg acctctagca 
tgtatctttg acctccttgt taactttgtc 
atggagccca agatgcagtc caagactaag 
agcccacgat ctgatgttaa tgacatcaaa 
caacctctac tacgccccaa ttcagcagga 
ccaacagcac ttaggttttc ctgttgagat 
tcctaggctg actaagaatc cttaagccta 
cacggggctt gcaactragc tcacacctga 
aggcaaagac aggaggtaaa gaaatagcca 
ggacaatgat cgggatataa acccaagttt 
cccctccctt tgtatgggag ctctgttctc 
tgcaaaaaaa aaaaaaaaaa aa 



tgtgtaaaat ttagcaatac tacatacaca 1500 
cctcccacac aaacagtctg cctaccctca 1560 
tatcgctgtt tgaatggctc ttcagaatcc 1620 
atgaccatct acactgaaca agatttatac 1680 
agagtaccca ttcttccttt tgttatagga 1740 
attggcggta tcacaacctc tactcagttc 1800 
gacatggaac gggtcgccga ctccctggtc 1860 
gcagnagtcc ttcgaaatcg aagagcctta 1920 
tgtttatttt taggggaaga atgctgttat 1980 
aaagttgaag aaattccaga tcgaatacaa 2040 
ccctggggcc tccccagccg acggatgccc 2100 
gctataacac ngccactcct ctttggaccc 2160 
tcttccagaa tcgaagctgt gaaactacaa 2220 
atctaccgca gacccctgga ccggcctgct 2280 
ggcacccctc ctgaggaaat ctcagctgca 2340 
agcagttaga gcggtggtcg gccaacctcc 24 00 
gggggactga gagacaggac tagctggatt 2460 
ggtgggaagg tgaccacatc cacctttaaa 2520 
ccaatcagag agctcactaa aatgctaatt 2580 
atcatttatt gcccgagagc acagcaggag 264 0 
tcgagccggc aacggcaacc ccctttgggt 2700 
atgctatttc actctattaa atcttgcaac 2760 

2782 



<210> 27 

<211> 666 

<212> DNA 

<213> Homo sapiens 

<400> 27 

tgtccgctgt gctcctgauc cagcgaggcg cccatzgccg ctcccaattg ggctaaaggc 60 

ttgccattgt tcctgcacgg ctaagtgcct gggtttgttc taattgagct gaacactant 120 

cactgggttc catggttctc ttctgtgacc cacggcttct aatataacca taacacttac 180 

cacatggccc aagattccat tccttggaat ccgtgaggcc aagaactcca ggtcagagaa 24 0 

tacgaggctt gccaccatct tggaagcggc ctgctaccat cttggaagtg gttcaccacc 300 

atcttgggag ctctgtgagc aaggaccccc cggtaacatt ttggcaacca cgaacggaca 3 60 

tccaaagtga atcgaagctg taaaactaca aatggagccc aagatgcagt ccaagactaa 42 0 

gatctaccgc agacccctgg accggcctgc tagcccacga tctgatgtta atgacatcaa 480 

aggcacccct cccgaggaaa tctcagctgc acaacctcta ctacgcccca attcagcagg 540 

aagcagttag agcggtcgtc ggccaacctc cccaacagca cttaggtttt cctgttgaga 600 

tgggggactg agagacagga ctagctggat ttcctaggct gactaagaat ccctaagcct 660 

agctgg 666 



<210> 28 

<211> 3372 

<212> DNA 

<213> Homo sapiens 

<400> 28 

gacttcccaa ataccagagg aagcagagtg 
cttctgcacc cctgtacatc ctgactctca 
cccagcatct caactcacct ggactatttt 
atttggccag gcattagccc aagacttgag 
ggtaggtgga tgatttactt ttggccgccc 
aagcgctctt caatttcctc gctacctgtg 
tctgctcaca gcaggttact tagggctaaa 



gtttacagtc ctggaccttc aggatgcctt 60 
attcttgttt gcctttgaag atacttcaaa 120 
accccaaggg ttcagggafca gtccccatct 180 
ccaatcctca tacctggaca cttgtccttc 24 0 
attcagaaac cttgtgccat caagccaccc 3 00 
gctacatggt ttccaaacca aaggctcaac 360 
attatccaaa ggcaccaggg ccctcagtga 420 
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ggaacacatc cagcctatac tggcttatcc tcatcccaaa accctaaagc aactaagggg 4 80 
attccttggc gtaataggtt tctgccgaaa atggattccc aggtatggcg aaatagccag 540 
gtcattaaat: acactaatta aggaaactca gaaagccaat acccatttag taagatggac 600 
aactgaagta gaagtggctt tccaggccct aacccaagcc ccagtgttaa gtttgccaac 660 
agggcaagac ttttgttcat atgtcacaga aaaaacagga atagctctag gagtccttac 720 
acagatccga gggatgagct tgcaacctgt ggcacacctg actaaggaaa ttgatgtagt 780 
ggcaaagggt tgacctcatt gtttacgggt agtggtggca gtagcagtct tagtatctga 84 0 
agcagttaaa ataatacagg gaagagatct cactgtgtgg acatctcatg atgtgaatgg 900 
catactcact gctaaaggag acttgtggct gtcagacaac tgtttactta aatgtcaggc 960 
tctattactt gaagggccag tgctgcgact gtgcacctgt gcaactctta acccagccac 102C 
atttct tcca gacaatgaag aaaagataaa acacaactgt caacaagtaa tttctcaaac 1080 
ctatgccact cgaggggacc ttttagaggt tcccttgact gaccccgacc tcaacttgta 1140 
tactgatgga agttcctttg tagaaaaagg acttcgaaaa gtggggtatg cagtggtcag 1200 
tgataatgga atactcgaaa gtaatcccct cactccagga actagtgctc agctagcaga 1260 
actaatagcc ctcacttggg cactagaatt aggagaagaa aaaagggcaa atataataca 132 0 
gactctaaat atgcttacct agtcctccat gcccatgcag caatatggaa agaaagggaa 13 8 0 
ttcctaactt ctgagagaac acctatcaaa catcaggaag ccattaggaa attattactg 1440 
gctgtacaga aacctagaga ggtggcagtc ttacactgcc ggggtcatca caaaggaaag 15 0 0 
gaaagggaaa tacaagagaa ctgccaagca ratattgaag ccaaaagagc tgcaaggcag 1560 
gaccctccat tagaaatgct tattaaactt ccctcagtat agggtaatcc cttccgggaa 1620 
accaagcccc agtactcagc aggagaaaca gaatggggaa cctcacgagg cagttttctc 1680 
ccctcgggac ggttagccac tgaagaaggg aaaatacttt tgcctgcaac tatccaatgg 174 0 
aaattactta aaacccttca tcaaaccttt cacttaggca tcgatagcac ccatcagatg 1800 
gccaaatcat tatttactgg accaggrcctt ttcaaaacta tcaagcagat agtcagggcc 186 0 
tgtgaagtgt gccagagaaa taatcccctg ccttatcgcc aagctccttc aggagaacaa 192 0 
agaacaggcc attaccctgg agaagactgg caactgattt tacccacaag cccaaacctc 1980 
agggatttca gtatctacta gtctgggtag atactttcac gggttgggca gaggccttcc 2040 
cctgtaggac agaaaaggcc caagaggtaa taaaggcact agttcatgaa ataattccca 210 0 
gattcggact tccccgaggc ttacagagtg acaatagccc tgctttccag gccacagtaa 2160 
cccagggagt atcccaggcg ttaggtatac gatatcactt acactgcgcc tgaaggccac 222 0 
agtcctcagg gaaggtcgag aaaatgaatg aaacactcaa aggacatcta aaaaagcaaa 2280 
cccaggaaac ccacctcaca tggcctgttc tgttgcctat agccttaaaa agaatctgca 234 0 
actttcccca aaaagcagga cttagcccat acgaaatgct gtatggaagg cccttcataa 24 00 
ccaatgacct tgtgcttgac ccaagacagc caacttagtt gcagacatca cctccttagc 24 60 
caaatatcaa caagttctta aaacattaca aggaacctat ccctgagaag aggaaaagaa 252 0 
tattccaccc aagtgacatg gtattagtca agtcccttcc ctctaattcc ccatccctag 2580 
atacatcctg ggaaggaccc tacccagtca ttttatctac cccaactgcg gttaaagtgg 2640 
ctggagtgga gtcttggata catcacactt gagfccaaatc ctggatactg ccaaaggaac 2700 
ctgaaaatcc aggagacaac gctagctatt cctgtgaacc tctagaggat ttgcgcctgc 2 76 0 
tcttcaaaca acaaccagga ggaaaaatcg aagctgtaaa actacaaatg gagcccaaga 2820 
tgcagtccaa gactaagatc taccgcagac ccctggaccg gcctgttagc ccacgatctg 288 0 
atgttaatga catcaaaggc acccctcctg aggaaatctc agctgcacaa cctctactac 2 940 
gccccaattc agcaggaagc agttagagcg gtcgtcggcc aacctcccca acagcactta 3 0 00 
ggctctcctg ttgagatggg ggactgagag acaggactag ctggatttcc taggctgatt 3 060 
aagaatccct aagcctagct gggaaggtga ccacatccac ctttaaacac ggggcttgca 3120 
acttagctca cacctgacca atcagagagc tcactaaaat gctaattagg caaagacagg 3180 
aggtaaagaa atagccaatc atttattgcc tgagagcaca gcaggaggga caatgatcgg 324 0 
gatataaacc caagttttcg agccggcaac ggcaaccccc tttgggtccc ctccctttgt 3300 
atgggagctc tgttttcatg ctatttcact ctattaaatc ttgcaactgc aaaaaaaaaa 3360 
aaaaaaaaaa aa 3372 



<210> 29 
<211> 2372 
<212> DKA 

<213> Homo sapiens 



<400> 29 
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actgcactct tctggtccat gtttcttacg gctcgagctg agcttttgct. caccgtccac 60 
cacr.gctgtt tgccaccacc gcagacctgc cgctgacccc catccctctg gatcctgcag 120 
ggtgtccgct gtgctcctga Cccagcgagg cgcccattgc cgctcccaat tgggctaaag 180 
gcttgccatit gttcctgcac ggctaagtgc ctgggtttgt tctaattgag ctgaacacta 24 0 
accactgggt tccatggtcc tcttctgcga cccacggctt ctaatagaac tataacactt 300 
accacatggc ccaagattcc attccttgga atccgtgagg ccaagaactc caggtcagag 360 
aatacgaggc ttgccaccat cttggaagcg gcctgctacc gtcttggaag tggttcacca 420 
ccatcttggg agctctgtga gcaaggaccc cccggtaaca ttttggcaac caacgacgga 4 80 
cacccaaagt gatgggaaac gttccccgca agacaaaaac gcccctaaga cgtattctgg 540 
agaattggga ccaatttgac cctcagacac taagaaagaa acgacttata ttcttctgca 600 
gtgccgcctg gcactcctga gggaagtaca aattataaca ccatcttaca gctagacctc 660 
ttttgtagaa aaggcaaatg gagtgaagtg ccataagtac aaactttctt ttcattaaga 720 
gacaactcac aattatgtaa aaagtgcgat ctatgcccta caggaagcct tcagagtcta 780 
cctccctatc ccagcatccc cgactccttc cccaactaat aaggaccccc ctccaaccca 840 
aatggtccaa aaggagatag acaaaagggt aaacagtgaa ccaaagagtg ccaatattcc 900 
ccaattatga cccctccaag cagtgggagg aagagaattc ggcccagcca gagtgcatgt 960 
gcctttttct ctcccagact taaagcaaat aaaaacagac ttaggtaaat tctcagataa 102 0 
ccctgatggc tatattgatg ttttacaagg gttaggacaa ttctttgatc tgacatggag 1080 
agatataatg tcactgctaa atcagacact aaccccaaat gagagaagtg ccaccataac 1140 
tgcagcctga gggtttggcg tctctggtat ctcagtcagg tcaatggata nggatgacaa 1200 
cagaaggaaa ganaatgatt ccccacaggc cagcaggcag ttcccagtct agaccctcat 12 60 
tgggacacag aatcagaaca tggagattgg tgctgcagac atttgctaac ttgtgtgcta 1320 
gaaggactaa ggaaaactag gaagaagtct atgaattact caatgatgtc caccataaca 1380 
cagggaaggg aagaaaatcc tactgccttt ctggagagac taagggaggc attgaggaag 144 0 
cgtgcctctc cgtcacctga ctcttctgaa ggccaactaa tcttaaagcg taagtttatc 1500 
actcagtcag ctgcagacat tagaaaaaac ttcaaaagtc cgccgtaggc ccggagcaaa 1560 
acttagaaac cctattgaac ttggcaacct cggtttttta taatagagat caggaggagc 162 0 
aggcggaaca ggacaaacgg gattaaaaaa aaggccaccg ctttagtcat gaccctcagg 1680 
caagtggact ttggaggctc tggaaaaggg aaaagctggg caaattgaat gcctaatagg 174 0 
gcttgcttcc agtgcggtct acaaggacac tttaaaaaag attgtccaag tagaagtaag 1800 
ccgccccttc gtccatgccc cttatttcaa gggaatcact ggaaggccca ctgccccagg 1860 
ggacaaaggt cttttgagtc agaagccact aaccagatga tccagcagca ggactgaggg 1920 
tgcctggggc aagcgccatc ccatgccatc accctcacag agccctgggt atgcttgacc 1980 
attgagggcc aggaaggttg tctcctggac actggtgcgg tcctcctagt cttactcttc 2 040 
tgtcccggac aactgtcctc cagatctgtc actattctga gggggtccnt aagacgggca 2100 
gtcactagat actttttccc agccactaag ttatgaactg gggagcttta ttcttttcac 2160 
atgcttttct aactatgctt gaaagcccca ctaccttgtt agggagagac attctagcaa 2220 
aagcaggggc cattatacac ctgaacatag gagaaggaac acccgtttgt tgtncccctg 228 0 
cttgaggaag gaattaatcc tgaagtctgg gcaacagaag gacaatatgg acgagccaaa 2340 
gaatgcccgt cctgttcaag ttaaactaaa gg 2372 



<210> 30 
<211> 7582 
<212> DKA. 

<213> Homo sapiens 
<400> 30 

caacaatcgg gacataaacc caggcattcg 
cttccctttg tatgggagct gttttcatgc 
ctcttctggt ccatgtttct tacggctcga 
tgtttgccac caccgcanac ctgccgctga 
cgctgtgctc ctgatccagc gargcgccca 
cattgtncct: gcacggctaa gtgcctgggt 
gggttccatg gttctcttct gtgacccacg 
tggcccaaga ttccattcct tggaatccgt 
gargcttgcc accatcttgg aagcggcctg 
ttgggagctc tgtgagcaag gaccccccgg 



agctggcaac agcagccccr ctttgggtcc 60 
tatttcactc tattaaatct tgcaactgca 120 
gctgagcttt tgctcaccgt ccaccactgc 180 
ctcccatccc tctggatcct gcagggtgtc 24 0 
ttgccgctcc caattgggct aaaggcttgc 3 00 
ttgttctaat tgagctgaac actantcact 360 
gcttctaata Jcaactataac acttaccaca 42 0 
gaggscaacg aactccaggt cagagaatac 480 
ctaccrtctt ggaagtggtt caccaccatc 540 
tracattttg gcraccamsr acggacatcc 600 
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maagtgatgg gaaacgntcc ccgcaagaca aaaacgcccc taagacgtat tctggaraat 660 
ngggamcaat ttgaccctca gacactaaga aagaaacgac ttatattctt ctgcagtgcc 720 
gcctggcact cctgagggaa gtataaatta taacaccatc ttacagctag acytcttttg 780 
tagaaaaggc aaatggagtg aagtgccata agcacaaact ttcttttcat taagagacaa 840 
ctcacaatta tgcaaaaagt gtgatttatg ccctacagga agccttcaga gtctaccccc 900 
ctatcccagc atccccgact ccttccccam ytaataagga ccccccttca acccaaatgg 960 
tccaaaagga gaiagacaaa agggtaaaca gtgaaccaaa gagtgccaat attccccaat 1020 
tatgacccct cccaagcagt gggaggaaga gaattcggcc cagccagagt gcatgcgcyt 1080 
tzttyytctcc cagactcaaa gcaaataaaa acagacttag gtaaattctc agataaycct 114 0 
gatggctata ttgrtgtttt acaagggtta ggacaattct ttgatctgac atggagagat 1200 
atatatgtca ctgctaaatc agacactaac cccaaatgag agaagtgcca ccataactgc 1260 
agcctgagrg tttggcgatc tctggtatct cagtcaggtc aatggatang gatgacaaca 1320 
gaaggaaaga naatgattcc ccacaggcca gcargcagtt cccagtctas accctcattg 1380 
gggacacaga aatcagtaac atgggagatt ggtgctgcag acatttgcta actcgtgtgc 144 0 
tasaaggact aaggaaaact asgaagaaar tctaygaatt actcaatgat gtccaccata 1500 
acacagggga agggaagaaa atcctactgc ctttctggag agactaaggg aggcattgag 1560 
gaagcgtgcc tctctgtcac ctgactcttc tgaaggccaa ctaatcttaa agcgtaagtt 1620 
catcactcag tcagctgcag acattagaaa aaacttcaaa agfcctgccgt aggcccggag 1680 
caaaacttag aaaccctatt gaactcggca acytcggttt tttataatag agatcaggag 174 c 
gagcaggcgg aacaggacaa acgggattaa aaaaaaggcc accgctttag tcatgaccct 1800 
caggcaagtg gactttggag gctctggaaa agggaaaagc tgggcaaatt gaatgcctaa 1860 
ta 9ggcttgc ttccagtgcg gtctacaagg acactttaaa aaagattgtc caagtagaag 192 0 
taagccgccc cttcgtccat gccccttatt tcaagggaac cactggaagg cccactgccc 1980 
caggggacaa aggtcttttg agtcagaagc cactaaccag atgatccagc agcaggactg 204 0 
agggtgcctg gggcaagcgc catcccatgc caccaccctc acagagccct gggtatgctt 210 0 
gaccattgag ggccaggaag gttgtctcct ggacactggt gcggtcttct tagtctcact 2160 
cttctgtccc ggacaactgt cctccagatc tgtcactatt ctgagggggt ccntaagacg 2220 
ggcagtcact agatacttty tcccagccac taagttatga actggggagc tttattcttt 22B0 
tcacatgctt tcctaattat gcttgaaagc cccaccacc: tgttagggag agacattcta 2340 
gcaaaagcag gggccattat acacctgaac ataggagaag gaacacccgt ttgttgtncc 24 00 
cctgcttgag gaaggaacta atcctgaagt ctgggcaaca gaaggacaat atggacgagc 24 6 0 
caaagaatgc ccgtcctgtt caagttaaac caaaggattc cacttccttt ccctaccaaa 2520 
ggcagtaccc cctcagaccc aaggcccaac aaggattcca aaagattgtt aaggacttaa 2580 
aagcccaagg cttagtaaaa ccatgcataa ctccctgcag taattccgta gtggattgag 2640 
gaggcacaga aacccagtgg acagtggagg gttagtgcaa gatctcagga ttatcaatgg 2700 
aggccgttgt ccttttatac ccagctgtac ctagccctca tactgtgmyt tcccaaatac 2760 
cagaggaagc agagtggttt acastcctgg accttmagga tgccttcttc tgcatccctg 282 0 
tacatcctga ctctcaattc ttgtttgcct ttgaagatac ttcaaaccca rcatctcaac 2880 
tcacctggac trttttaccc caagggttca gggatagycc ccatctattt ggccaggcat 294 0 
tagcccaaga cttgagycar tymtcatacc tggacactct tgtccttcrg takgtggatg 3 000 
atttactttt rgcygccyrt tcagaaacct tgtgccatca agccacccaa gcrctcttma 3 060 
atttcctcgc yacctgtggc tacawggttt ccaaacsara rgctcarctc tgctcacagc 312 0 
aggttaaata cttaggrcta arattatcca aaggcaccar ggccctcagt gaggaayrya 3180 
tccagcctat actggcttat cctcatcyca aaaccctaaa gcaactaagr grrttccttg 324 0 
gcrtaayagg yttctgccga awatggattc cccaggtwtg gcraaatagc caggycatta 3300 
watacastaa ttaaggaaac tcagaaagcc aatacccatt tartaagatg gayamctgaa 3360 
gymraagtgg ctttccaggc ccctaaagaa ggccttaaac ccaagyccca gtgttaagyt 3420 
tgccaacrgg gcaagacttt tstityatayr tcacagaaaa aaacagraay agctctrgga 34 8 0 
gtccttacac agrtccragg gaygagcttg caaccyrtgg cryacctgas taaggaaayt 3540 
gatgtagtgg caaagggttg rcytcattgt ttaygggtag tggtggcagt agcagtyJcta 3600 
gtatctgaag cagttaaaat aatacagggr agagatctta ctgtgtggac atctcatgak 3660 
gtgaayrgca tactcactgc taaaggagac ttgtggctgt cagacaacyg tttacttaaa 3720 
trtcaggctc tattacttga arggccagtg ctgcractgt gcacttgtgc aactcttaac 3780 
ccagycncat ttcttccaga caatgaagaa aagataraay ataactgtca acaartaatt 3840 
tctcaaacct atgccactcg aggggacctt ytagargttc cyttgactga tccygacctt 3 900 
caacttgtat actgatggaa gttcctttgt agaaaaagga cttcgaaaag yggggtatgc 3 960 
a 9tggtcagt gataatggaa tayttgaaag taatcccctc actccaggaa ctagtgctya 4 020 
gctrgcagaa ctaatagccy tcaytkgggc actagaatta ggagaagraa aaagggyaaa 4 080 
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tatatataca gactctrart atgctyacct agtcntccat gcccatgmrg caatatgsar 4140 
agaaagggaa ttcctaactt cygagrgaac acctatcama catcaggaag ccattaggar 4200 
attattaytg gcwgtacaga aacctaraga ggtggmagtc ttacactgcy ggggtcatca 4260 
naaaggaaag raaagggaaa tasaagrgaa ytgccaagca katattgaag cmaaaagagc 4320 
tgcaaggcag gaccctccat tagaaatgct tattaaactt cccntagtat agggtaatcc 4380 
cttccgggaa accaagcccc agtactcagc aggagaaaca gaatggggaa cctcacgagg 444 0 
cagttttctc ccctcgggac ggttagccac tgaagaaggg aaaatacttt tgcctgcaac 4500 
tatccaatgg aaattactta aaacccttca tcaaaccttt cacttaggca tcgatagcac 4560 
ccatcaratg gccaaatcat tatttactgg accaggcctt ttcaaaacta tcaagearat 4620 
aktcagggcc tgtgaaktgt gccararaaa taatcccctg cctyatcgcc aagctccttc 4680 
aggaraacaa araacaggcc attaccctgr araaractgg caactgattt tacccacaag 474 0 
cccaaacctc agggatttca gtatctacta gtctgggtar atactttcac gggttgggca 4800 
raggccttcc cctgtaggac agaaaaggcc caagaggtaa taaaggcact agttcatgaa 4860 
ataattccca gattcggact tccccgaggc ttacagagtg acaatagccc tgctttccag 4920 
gccacagtaa cccagggagt atcccaggcg ttaggtatac gatatcact^ acactgcgcc 4980 
tgaaggccac agtcctcagg gaaggtcgag aaaatgaatg aaayactcaa aggacatcta 5040 
aaaaagcaaa cccaggaaac ccacctcaca tggcctgytc tgttgcctat agccttaaaa 5100 
agaatctgca actttcccca aaaagcacga cctagcccat acgaaatgct gtatggaagg 5160 
cccttcataa ccaatgacct tgtgcttgac ccaagacagc caacttagtt gcagacatca 522 0 
cctccttagc caaatatcaa caagttctta aaacattaca aggaacctac ccctgagaag 5280 
agggaaaaga actattccac ccwwgtgaca tggtattagt caagtccctt cyctccaatt 5340 
ccccatccct agatacatcc tgggaaggac cctacccagt cattttatyt accccaactg 54 00 
cggttaaagt ggcfcggagtg gagtcttgga tacatcacac ttgagtcaaa tcctggatac 5460 
tgccaaagga acctgaaaat ccaggagaca acgcnagcta ttcctgtgaa cctctagagg 5520 
atttgcgcct gctcttcaaa caacaaccag gaggaaagta actaaaatca taaatccccc 5580 
atggscctcc cttatcatat ttttctctkt astgttsttt yaccctsttt cactctcact 5640 
gcaccccctc catgccgctg tatgaccagt agctccccty accmagagtt tctatggaga 570 0 
atgcagcgtc ccggaaatat tgatgcccca tcgtatagga gtctttstaa gggaaccccc 5760 
accttcactg cccacaccca tatgccccgc aactgctatc actctgccac tctttgcatg 5820 
catgcaaata ctcattattg gacaggaaaa atgattaatc ctagttgtcc tggaggactt 5880 
ggagtcactg tctgttggac ttacttcacc caaactggta tgtctgatgg gggtggagtt 594 0 
caagatcagg caagagaaaa acatgtaaaa gaagtaatct cccaactcac csgggtacat 6000 
ggcacctcta gcccctacaa aggactagat ctctcaaaac tacatgaaac cctccgtacc 6060 
catactcgcc tggtaagcct atttaatacc accctcactg ggctccatga ggtctcggcc 6120 
caaaacccta ctaactgttg gatatgcctc cccctgaact tcargccata tgtttcaatc 6180 
cctgtacctg aacaatggaa caacttcagc acagaaataa acaccacttc cgttttagta 6240 
ggacctcttg tttccaatst ggaaataacc catacctcaa acctcacctg tgtaaaattt 6300 
agcaatacta catacacaac caactcccaa tgcatcaggt gggtaactcc tcccacacaa 6360 
atagtctgcc taccctcagg aatatttttt gtctgtggta cctcagccta tcgttgtttg 6420 
aatggctctt cagaatctat gtgcttcctc tcattcttag tgcccccyat grccatctac 6480 
actgaacaag atttatacag ttatgtcata tctaagcccc gcaacaaaag agtacccatt 654 0 
cttccttttg ttataggagc aggagtgcta ggtgcactag gtactggcat tggcggtatc 6600 
acaacctcta ctcagttcta ctacaaacta tctcaagaac taaatgggga catggaacgg 6660 
gtcgccgact ccctggtcac cttgcaagat caacttaact ccctagcagc agtagtcctt 6720 
craaatcgaa gagctttaga ctygctaacc gctgaragag ggggaacctg tttattttta 6780 
ggggaagaat gctgttatta tgttaatcaa tccggaatcg tcactgagaa agttraagaa 684 0 
attcsagatc gaatacaacg takagcagar gagcttcgaa acactggacc ctggggcctc 6900 
ctcagccrat ggatgccctg gattctcccc ttcttaggac ctctagcagc tataatattg 6960 
ctactcctct ttggaccctg tatctttrac ctccttgtta actttgtctc ttccagaatc 7020 
gaagctgtra aactacaaat ggagcccaag atgcagtcca agactaagat ctaccgcaga 7080 
cccctggacc ggcctgytag cccacgatct gatgttaatg acatcaaagg cacccctcct 7140 
gaggaaatct cagctgcaca acctctacta cgccccaatt cagcaggaag cagttagagc 7200 
ggtsgtcggc caacctcccc aacagcactt aggttttcct gttgagatgg gggactgaga 7260 
gacaggacta gctggatttc ctaggctgay taagaatccy taagcctags tgggaaggtg 7320 
accacatcca cctttaaaca cggggcttgc aacttagytc acacctgacc aatcagagag 7380 
ctcactaaaa tgctaattag gcaaagacag gaggtaaaga aatagccaat catytattgc 7440 
mtgagagcac agcaggaggg acaatgatcg ggatataaac ccaagtyttc gagccggcaa 7500 
cggcaacccc ctttgggtce cctccctttg tatgggagct ctgttttcat gctatttcac 7560 
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tctattaaat cttgcarctg cr 



<^210> 31 

<211> 363 

<212> PRT 

<213> Homo sapiens 



<400> 31 

Met Gly Asn lie 

1 

Glu Asn Trp Asp 
20 

lie Phe Phe Cys 
35 

Glu Thr Trp Leu 
50 



Pro Pro Lys Ala 
5 

Gin Cys Asp Thr 

Ser Thr Ala Trp 
40 

Pro Glu Gly Ser 
55 



Lys Thr Pro Leu 

10 

Gin Thr Leu Arg 
25 

Pro Gin Tyr Pro 



lie Asn Tyr Asn 
60 



Arg Cys He Leu 
15 

Lys Lys Arg Phe 

30 

Leu Gin Gly Arg 
45 

He He Leu Gin 



Leu Asp Leu Phe Cys Arg Lys Glu 
65 70 

Val Gin Thr Phe Phe Ser Leu Arg 
85 

Cys Gly Leu Cys Pro Thr Gly Ser 
100 

Ser Val Pro Pro Pro Thr Pro Ser 
115 120 

Thr Gin Thr Val Gin Lys Glu He 
130 135 



Gly Lys Trp Ser Glu val Pro Tyr 
75 60 

Asp Asn Ser Gin Leu Cys Lys Lys 
90 95 

Pro Gin Ser Pro Pro Pro Tyr Pro 
105 110 

Ser Thr Asn Lys Asp Pro Pro Leu 
125 

Asp Lys Gly Val Asn Asn Glu Pro 
14 0 



Lys Ser Ala Asn 
145 

Gly Glu Phe Gly 



Leu Lys Gin He 
180 

Gly Tyr He Asp 
195 

Trp Arg Asp He 
210 



Arg Ser Ala Ala 
225 

Leu Ser Gin Val 



Thr Gly Gin Gin 



He Pro Arg Leu 
15 0 

Pro Ala Arg Val 
165 

Lys He Asp Leu 



Val Leu Gin Gly 
200 

Met Leu Leu Leu 
215 

Val Thr Ala Ala 
230 

Asn Asn Arg Met 
245 

Ala Val Pro Ser 



Cys Pro Leu Gin 
155 

Pro Val Pro Phe 
170 

Gly Lys Phe Ser 
185 

Leu Gly Gin Ser 



Asn Gin Thr Leu 
220 

Arg Glu Phe Gly 
235 

Thr Thr Glu Glu 
250 

Val Asp Pro His 



Ala Val Arg Gly 
160 

Ser Leu Ser Asp 
175 

Asp Asn Pro Asp 
190 

Phe Asp Leu Thr 
205 

Thr pro Asn Glu 



Asp Leu Trp Tyr 
240 

Arg Thr Thr Pro 
255 

Trp Asp Thr Glu 
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260 

Ser Glu His Gly 
275 

Glu Gly Leu Arg 
290 

Ser Thr lie Thr 
305 

Arg Leu Arg Glu 



He Glu Gly Gin 
340 



Asp Trp Cys His 
280 

Lys Thr Arg Lys 
295 

Gin Gly Lys Glu 

310 

Ala Leu Arg Lys 
325 

Leu He Leu Lys 



265 

Lys His Leu Leu 



Lys Pro Met Asn 

300 

Glu Asn Pro Thr 
315 

His Thr Ser Leu 
330 

Asp Lys Phe He 
345 



270 

Thr Cys Val Leu 
285 

Tyr Ser Met Met 



Ala Phe Leu Asp 
320 

Ser Pro Asp Ser 
335 

Thr Gin Ser A] a 
350 



Ala Asp" He Arg Lys Asn Phe Lys Ser Leu Pro 
355 360 



